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1^ (57) Abstract: The invention relates to heterodimeric polypeptide conjugates exhibiting FSH activity, comprising a dimeric polypep- 
tide comprising an FSH-ct subunit and an FSH-p subunit, wherein at least one of the FSH-ct and FSH-p subunits differs from the 
corresponding wildtype subunit in that at least one amino acid residue acid residue comprising an attachment group for a non-polypep- 
tide moiety has been introduced or removed, and having at least one non-polypeptide moiety bound to an attachment group of at 
least one of said subunits. Preferably, at least one attachment group, e.g. an N- or O-glycosylation site or an attachment site for 
a polymer molecule such as polyethylene glycol, has been introduced, e.g. at an N-terminal. The polypeptide conjugates exhibit 

)^ improved properties, in particular an increased half-life, compared to human FSH. 
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FOLLICLE STIMULATING HORMONES 
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Field of the invention 

The present invention relates to new polypeptides and polypeptide conjugates 
exhibiting follicle stimulating hormone (FSH) activity, to methods for preparing such poly- 
5 peptides and conjugates, and to the use of such polypeptides and conjugates in therapy, in 
particular in the treatment of infertility. 

Background of the invention 

Follicle Stimulating Hormone (FSH) is a dimeric hormone consisting of an a 

10 subunit and a (3 subunit. The a subunit is common to the glycoprotein hormone family, which 
apart from FSH includes chorionic gonadotropin (CG), thyroid stimulating hormone (TSH), 
and luteinizing hormone (LH), whereas the |3 subunit is specific to FSH. The human wildtype 
a subunit is a 92 amino acid glycoprotein, the amino acid sequence of which is shown in SEQ 
ID NO:2. Said subunit is referred to herein as hFSH-ot. The human wildtype p subunit is a 

15 111 amino acid glycoprotein that has the amino acid shown in SEQ ID NO:4. This subunit is 
referred to herein as hFSH-(3. 

Human FSH (hFSH) has been isolated from pituitary glands and from post- 
menopausal urine (EP 322 438) and has been produced recombinantly in mammalian cells 
(US 5,639,640, US 5,156,957, US 4,923,805, US 4,840,896, US 5,767,251, EP 211,894 and 

20 EP 521,586). The latter references also disclose the hFSH-(3 gene. US 5,405,945 discloses a 
modified human a subunit gene comprising only one intron. 

US 4,589,402 and US 4,845,077 disclose purified hFSH which is free of LH 
and the use thereof for in vitro fertilization. EP 322 438 discloses a protein with at least 6200 
U/mg FSH activity which is substantially free of LH activity, and wherein the FSH a subunit 

25 and p subunit, respectively, may be wildtype or specified truncated forms thereof. 

Liu et aL, J Biol Chem 1993, 15;268(2):21613-7, Grossmann et aL, Mol Endo- 
crinol 1996 10(6): 769-79, Roth andDias {Mol Cell Endocribol 1995 1; 109(2): 143-9, 
Valove et al., Endocrinology 1994; 135(6):2657-61 ,Yoo et aL, J Biol Chem 1993 25; 
268(18): 13034-42), US 5,508,261 and Chappel et al., 1998, Human Reproduction, 13(3): 18- 

30 35 disclose various structure-function relationship studies and identify amino acid residues 
involved in receptor binding and activation and in dimerization of FSH. 
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It has been found that glycosylation of FSH-a and FSH-P is essential for recep- 
tor signal transduction. hFSH-cc comprises two N-glycosylation sites at the asparagines lo- 
cated at position 52 and 78, whereas hFSH-p comprises two N-glycosylation sites at the aspa- 
ragines located at positions 7 and 24. The importance of the various N-glycosylation sites for 
5 the binding and signal-transducing activities of FSH are discussed, inter alia, by Valove et al., 
Endocrinology 1994; 135(6):2657-61 and Flack et al., J Biol Chem 1994 13;269(19): 14015- 
20. 

Galway et al„ Endocrinology 1990; 127(1):93-100 demonstrate that FSH vari- 
ants produced in a N-acetylglucosamine transferase-I CHO cell line or a CHO cell line defec- 

10 tive in sialic acid transport are as active as FSH secreted by wildtype cells or purified pituitary 
FSH in vitro, but lacked in vivo activity, presumably due to rapid clearance of the inade- 
quately glycosylated variants in serum. D' Antonio et al., Human Reprod 1999; 14(5): 1160-7 
describe various FSH isoforms circulating in the blood stream. The isoforms have identical 
amino acid sequences, but differ in their extent of post-translational modification. It was 

15 found that the less acidic isoform group had a faster in vivo clearance as compared with the 
acidic isoform group, possibly due to differences in the sialic acid content between the iso- 
forms. 

US 5,087,615 discloses a method for stimulating follicle development and ovu- 
lation in a female patient by administering FSH to said patient during the follicular phase of 

20 the ovulatory cycle, the improvement comprising initially adminstering a first FSH isoform 
having a relatively long plasma half -life and subsequently administering a second FSH iso- 
form having a shorter plasma half-life. 

Bishop et al. Endocrinology 1995; 136(6):2635-40 conclude that circulatory 
half-life appears to be the primary determinant of in vivo activity. 

25 Attempts have been made to prolong the serum half-life of FSH. US 5,338,835 

and US 5,585,345 disclose a modified FSH-P subunit extended at the C-terminal Glu with the 
carboxy terminal portion (CTP) region of hCG (the region consisting of the amino acid se- 
quence which occurs from positions 112-118 to 145, and comprising four O-linked glycosyla- 
tion sites located at positions 121, 127, 132 and 138). The resulting modified subunit is stated 

30 to have the biological activity of native FSH, but a prolonged circulating half-life. US 

5,405,945 discloses that the carboxy terminal portion of the CG p subunit or a variant thereof 
has significant effects on the clearance of CG, FSH, and LH. 
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US 5,883,073 discloses single-chain proteins comprised of two oc-subunits with 
agonist or antagonist activity for CG, TSH, LH and FSH. 

US 5,508,261 discloses heterodimeric polypeptides having binding affinity to 
LH and FSH receptors comprising a glycoprotein hormone a subunit and a non-naturally oc- 
5 cuning (3 subunit polypeptide, wherein the (3 subunit polypeptide is a chain of amino acids 
comprising four joined subsequences, each of which is selected from a list of specific se- 
quences. 

US 5,567,422 and WO 98/32466 mention FSH among a vast number of other 
therapeutic proteins that may be PEGylated. 

io Currently, FSH is used therapeutically to stimulate the growth and maturation 

of ovarian follicles in infertile women. In particular, FSH is used in connection with in vitro 
fertilization as well as for the treatment of anovulatory women, with anovulatory syndrome or 
luteal phase deficiency. However, one problem encountered in current FSH treatment is the 
fairly short in vivo half-life of FSH requiring frequent, usually daily administration of the 

15 product. The frequent administration is very inconvenient for the patient and results in high 
fluctuations of FSH activity in the blood stream, which may cause inadequate maturation of 
the follicles. 

Therefore, a clinical need exists for a product which provides part or all of the 
therapeutically relevant effects of FSH, and which may be administered at less frequent inter- 
20 vals as compared to currently available FSH product, and which preferably provides a more 
stable level of circulating FSH activity as compared to that obtainable by current treatment. 
The present invention is directed to such products as well as the means of making such prod- 
ucts. 

25 Brief disclosure of the invention 

More specifically, the present invention relates to polypeptide conjugates ex- 
hibiting FSH activity and methods for their preparation and their use in medical treatment. 

Accordingly, in its first aspect the invention relates to a heterodimeric polypep- 
tide conjugate exhibiting FSH activity, comprising i) a dimeric polypeptide comprising an 
30 FSH-oc subunit and an FSH-p subunit, wherein at least one of said FSH-oc and FSH-fi subunits 
differs from the corresponding wildtype subunit in that at least one amino acid residue acid 
residue comprising an attachment group for a non-polypeptide moiety has been introduced or 
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removed, and ii) at least one non-polypeptide moiety bound to an attachment group of at least 
one of said subunits. 

In another aspect the invention relates to a heterodimeric polypeptide conjugate 
exhibiting FSH activity, comprising i) a dimeric polypeptide comprising an FSH-oc subunit 
5 and an FSH-p subunit, wherein the amino acid sequence of at least one of said FSH~oc and 
FSH-p subunits differs from that of the corresponding wildtype subunit in that at least one N- 
glycosylation site has been introduced, and ii) at least one oligosaccharide moiety bound to an 
N-glycosylation site of at least one of said subunits. 

In a further aspect, the invention relates to a heterodimeric polypeptide conju- 
10 gate exhibiting FSH activity, comprising a dimeric polypeptide comprising FSH-oc and FSH-P 
subunits, wherein at least one of said FSH-oc and FSH-P subunits comprises, relative to the 
corresponding wildtype subunit, at least one introduced N- or O-glycosylation site at the N- 
terminal thereof, said at least one introduced glycosylation site being glycosylated. 

In the above aspects the corresponding wildtype subunits are preferably hFSH- 
15 a and hFSH-P, respectively. 

Another aspect of the invention relates to a heterodimeric polypeptide conju- 
gate exhibiting FSH activity, comprising a dimeric polypeptide comprising an FSH-oc subunit 
and an FSH-p subunit, wherein at least one of said FSH-oc and FSH-P subunits comprises a 
polymer molecule bound to the N-terminal thereof. 
20 In a further aspect the invention relates to modified FSH-oc and modified FSH- 

p polypeptides that may be used as intermediate products for the preparation of a conjugate 
with a polymer molecule. 

In still further aspects the invention relates to methods for preparing a conju- 
gate or a polypeptide of the invention, including nucleotide sequences and expression vectors 
25 encoding a polypeptide or a conjugate of the invention. 

In final aspects the invention relates to a composition comprising a conjugate 
or polypeptide of the invention and methods of treating a mammal with such composition. In 
particular, the polypeptide, conjugate or composition of the invention may be used to treat 
infertility. 



30 
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Description of the drawing and sequence listing 

Figure 1 shows a sequence alignment of human FSH to the structural part of 
two published structures of human chorionic gonadotropin. 

SEQ ID NO:l is the complete amino acid sequence of the common a chain, 
5 the "glycoprotein hormones a chain" (Fiddes et al., Nature 281:351-356 (1979)). Rathnam et 
aL, Biol Chem. 250:6735-6746 (1975) reports residue Q29 to be a Glu. Sairam et aL, Cam 
J, Biochem, 55:755-760 (1977), and Sairam et aL, Biochem. Biophys. Res. Commun. 48:530- 
537 (1972) report the sequence CS at positions 108-109 to be SC. FSH-a variants having 
these changes are intended to be encompassed by the term "FSH-a" as used herein. 
10 SEQ ID NO:2 is the mature amino acid sequence of the common a chain 

shown in SEQ ID NO:l. 

SEQ ID NO:3 is the complete amino acid sequence of the human FSH (3 chain 
(Tanzi et aL, DNA 6:205-212(1987)). 

SEQ ID NO:4 is the mature amino acid sequence of the human FSH 3 chain 
15 shown in SEQ ID NO:3. 

SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 are DNA sequences of plas 
mids described in the Examples. 

Detailed disclosure of the invention 

20 Definitions 

In the context of the present application and invention the following definitions 

apply: 

The term "conjugate" is intended to indicate a heterogeneous molecule formed 
by the covalent attachment of one or more polypeptides to one or more non-polypeptide moie- 

25 ties such as polymer molecules, oligosaccharide moieties, lipophilic compounds, carbohydrate 
moieties or organic derivatizing agents. The term covalent attachment means that the polypep- 
tide and the non-polypeptide moiety are either directly covalently joined to one another, or 
else are indirectly covalently joined to one another through an intervening moiety or moieties, 
such as a bridge, spacer, or linkage moiety or moieties. Preferably, the conjugate is soluble at 

30 relevant concentrations and conditions, i.e. soluble in physiological fluids such as blood. The 
term "non-conjugated polypeptide" may be used about the polypeptide part of the conjugate. 
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The term "polypeptide" may be used interchangeably herein with the term 
"protein". Further, the terms "polypeptide" and "protein" are generally used herein for the 
sake of simplicity to refer to the heterodimeric FSH polypeptides/proteins and conjugates of 
the invention, even though these proteins strictly speaking comprise a dimer of the a and p 

5 polypeptide subunits. The individual subunits are referred to herein as FSH-oc and FSH- (3, 
respectively, so that it is clear from the context whether reference is made to the dimeric hor- 
mone or to one of the subunits. 

The "polymer molecule" is a molecule formed by covalent linkage of two or 
more monomers, wherein none of the monomers is an amino acid residue, except where the 

10 polymer is human albumin or another abundant plasma protein. The term "polymer" may be 
used interchangeably with the term "polymer molecule". The term is intended to cover carbo- 
hydrate molecules attached by in vitro glycosylation. Carbohydrate molecules attached by in 
vivo glycolsylation, such as N- or O-glycosylation (as further described below) are referred to 
herein as "an oligosaccharide moiety". Except where the number of polymer molecules is 

15 expressly indicated, every reference to "a polymer", "a polymer molecule", "the polymer" or 
"the polymer molecule" contained in polypeptide of the invention or otherwise used in the 
present invention shall be a reference to one or more polymer molecule(s). 

The term "attachment group" is intended to indicate an amino acid residue 
group of the polypeptide capable of coupling to the relevant non-polypeptide moiety. For in- 

20 stance, for polymer conjugation to PEG, a frequently used attachment group is the £-amino 
group of lysine or the N-terminal amino group. Other polymer attachment groups include a 
free carboxylic acid group (e.g. that of the C-terminal amino acid residue or of an aspartic 
acid or glutamic acid residue), suitably activated carbonyl groups, oxidized carbohydrate 
moieties and mercapto groups. Useful attachment groups and their matching non-peptide 



25 moieties are apparent from the table below. 



Attachment 
group 


Amino acid 


Examples of non- 
peptide moiety 


Conjugation 
method/- 
Activated PEG 


Reference 


-NH 2 


N-terminal, 
Lys, His, Arg 


Polymer, e.g. PEG, 
with amide or 
imine group 


mPEG-SPA 

Tresylated 

mPEG 


Shearwater Inc. 
Delgado et al., criti- 
cal reviews in Thera- 
peutic Drug Carrier 
Systems 9(3,4):249- 
304 (1992) 
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-COOH 


C-term ? Asp, 
Glu 


Polymer, e.g. PEG, 
with ester or amide 
group 

Oligosaccharide 
moiety 


mPEG-Hz 

In vitro coupling 


Shearwater Inc. 


-SH 


Cys 


Polymer, e.g. PEG, 
with disulfide, 
maleimide or vinyl 
sulfone group 

Oligosaccharide 
moiety 


PEG- 

vinylsulphone 
PEG-maleimide 

In vitro coupling 


Shearwater Inc. 
Delgado et al., criti- 
cal reviews in Thera- 
peutic Drug Carrier 
Systems 9(3,4):249- 
304 (1992) 


-OH 


Ser, Thr, -OH, 
Lys 


Oligosaccharide 
moiety 

PEG with ester, 
ether, carbamate, 
carbonate 


In vivo O-linked 
glycosylation 




-CONH2 


Asn as part of 
an N-glyco- 
sylation site 


Oligosaccharide 
moiety 

Polymer, e.g. PEG 


In vivo N- 
glycosylation 




Aromatic 
residue 


Phe, Tyr, Trp 


Oligosaccharide 
moiety 


In vitro coupling 




-CONH2 


Gin 


Oligosaccharide 
moiety 


In vitro coupling 


Yan and Wold, Bio- 
chemistry, 1984, Jul 
31; 23(16): 3759-65 


Aldehyde 
Ketone 


Oxidized 
oligo- 
saccharide 


Polymer, e.g. PEG, 
PEG-hydrazide 


PEGylation 


Andresz et al., 1978, 
Makromol. Chem. 
179:301, WO 
92/16555, WO 
00/23114 


Guanidino 


Arg 


Oligosaccharide 
moiety 


In vitro coupling 


Lundblad and 
Noyes, Chemical 
Reagents for Protein 
Modification, CRC 
Press Inc., Florida, 
USA 


Imidazole 
ring 


His 


Oligosaccharide 
moiety 


In vitro coupling 


As for guanidine 



For in vivo N-glycosylation, the term "attachment group" is used in an uncon- 
ventional way to indicate the amino acid residues constituting an N-glycosylation site (with 
the sequence N-X'-S/T/C-X", wherein X' is any amino acid residue except proline, X" any 
5 amino acid residue which may or may not be identical to X' and which preferably is different 
from proline, N is asparagine, and S/T/C is either serine, threonine or cysteine, preferably 
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serine or threonine, and most preferably threonine). Although the asparagine residue of the N- 
glycosylation site is where the oligosaccharide moiety is attached during glycosylation, such 
attachment cannot be achieved unless the other amino acid residues of the N-glycosylation 
site are present. Accordingly, when the non-peptide moiety is an oligosaccharide moiety and 
5 the conjugation is to be achieved by N-glycosylation, the term "amino acid residue compris- 
ing an attachment group for the non-peptide moiety" as used in connection with alterations of 
the amino acid sequence of the polypeptide of interest is to be understood as meaning that one 
or more amino acid residues constituting an N-glycosylation site are to be altered in such a 
manner that either a functional N-glycosylation site is introduced into the amino acid se- 

10 quence or removed from said sequence. 

In the present application, amino acid names and atom names (e.g. CA, CB, 
NZ, N, O, C, etc.) are used as defined by the Protein DataBank (PDB) (www.pdb.org) , which 
is based on the IUPAC nomenclature (IUPAC Nomenclature and Symbolism for Amino Ac- 
ids and Peptides (residue names, atom names etc.), Eur. J. Biochem,, 138, 9-37 (1984) to- 

15 gether with their corrections in Eur. J. Biochem., 152, 1 (1985). The term "amino acid resi- 
due" is primarily intended to indicate an amino acid residue contained in the group consisting 
of the 20 naturally occurring amino acids, i.e. alanine (Ala or A), cysteine (Cys or C), aspartic 
acid (Asp or D), glutamic acid (Glu or E), phenylalanine (Phe or F), glycine (Gly or G), his- 
tidine (His or H), isoleucine (lie or I), lysine (Lys or K), leucine (Leu or L), methionine (Met 

20 or M), asparagine (Asn or N), proline (Pro or P), glutamine (Gin or Q), arginine (Arg or R), 
serine (Ser or S), threonine (Thr or T), valine (Val or V), tryptophan (Trp or W), and tyrosine 
(Tyr or Y) residues. 

The terminology used for identifying amino acid positions/substitutions is illus- 
trated as follows: E9(a) indicates position number 9 occupied by a glutamic acid residue in the 

25 amino acid sequence shown in SEQ ID NO:2. E9(a)N indicates that said glutamic acid residue 
has been substituted by an asparagine residue. Unless otherwise indicated, the numbering of 
amino acid residues made herein is made relative to the amino acid sequence shown in SEQ 
ID NO:2 (for FSH-a, indicated by "(a)") or SEQ ID NO:4 (for FSH-(3, indicated by "(b)"). 
Multiple substitutions are indicated with a "+", e.g. M109(b)N+Elll(b)S/T means an amino 

30 acid sequence which comprises substitution of the methionine residue in position 109 of FSH- 
p by an asparagine residue and substitution of the glutamic acid residue in position 111 in 
FSH-P by a serine or a threonine residue. 
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The term "nucleotide sequence" is intended to indicate a consecutive stretch of 
two or more nucleotide molecules. The nucleotide sequence may be of genomic, cDNA, 
RNA, semisynthetic, synthetic origin, or any combination thereof. 

The term "polymerase chain reaction" or "PCR" generally refers to a method 
for amplification of a desired nucleotide sequence in vitro, as described, for example, in US 
4,683,195. In general, the PCR method involves repeated cycles of primer extension synthe- 
sis, using oligonucleotide primers capable of hybridising preferentially to a template nucleic 
acid. 

"Cell", "host cell", "cell line" and "cell culture" are used interchangeably 
herein and all such terms should be understood to include progeny resulting from growth or 
culturing of a cell. "Transformation" and "transfection" are used interchangeably to refer to 
the process of introducing DNA into a cell. 

"Operably linked" refers to the covalent joining of two or more nucleotide se- 
quences, by means of enzymatic ligation or otherwise, in a configuration relative to one an- 
other such that the normal function of the sequences can be performed. For example, the nu- 
cleotide sequence encoding a presequence or secretory leader is operably linked to a nucleo- 
tide sequence for a polypeptide if it is expressed as a preprotein that participates in the secre- 
tion of the polypeptide: a promoter or enhancer is operably linked to a coding sequence if it 
affects the transcription of the sequence; a ribosome binding site is operably linked to a cod- 
ing sequence if it is positioned so as to facilitate translation. Generally, "operably linked" 
means that the nucleotide sequences being linked are contiguous and, in the case of a secre- 
tory leader, contiguous and in reading phase. Linking is accomplished by ligation at conven- 
ient restriction sites. If such sites do not exist, then synthetic oligonucleotide adaptors or link- 
ers are used, in conjunction with standard recombinant DNA methods. 

The term "introduce" refers to introduction of an amino acid residue compris- 
ing an attachment group for a non-polypeptide moiety, either by substitution of an existing 
amino acid residue or by insertion of an additional amino acid residue. The term "remove" 
refers to removal of an amino acid residue comprising an attachment group for a non- 
polypeptide moiety, either by substitution of the amino acid residue to be removed by another 
amino acid residue or by deletion (without substitution) of the amino acid residue to be re- 
moved. 

When substitutions are performed in relation to a parent polypeptide, they are 
preferably "conservative substitutions", in other words substitutions performed within groups 
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of amino acids with similar characteristics, e.g. small amino acids, acidic amino acids, polar 
amino acids, basic amino acids, hydrophobic amino acids and aromatic amino acids. 

Preferred substitutions in the present invention may in particular be chosen 
from among the conservative substitution groups listed in the table below. 

5 

Conservative substitution groups: 



1 


Alanine (A) 


Glycine (G) 


Serine (S) 


Threonine (T) 


2 


Aspartic acid (D) 


Glutamic acid (E) 






3 


Asparagine (N) 


Glutamine (Q) 






4 


Arginine (R) 


Histidine (H) 


Lysine (K) 




5 


Isoleucine (I) 


Leucine (L) 


Methionine (M) 


Valine (V) 


6 


Phenylalanine (F) 


Tyrosine (Y) 


Tryptophan (W) 





The term "immunogenicity" as used in connection with a given substance is in- 
tended to indicate the ability of the substance to induce a response from the immune system. 
10 The immune response may be a cell or antibody mediated response (see, e.g., Roitt: Essential 
Immunology (8 th Edition, Blackwell) for further definition of immunogenicity). Normally 
reduced antibody reactivity will be an indication of a reduced immunogenicity. The reduced 
immunogenicity may be determined by use of any suitable method known in the art, e.g. in 
vivo or in vitro, 

15 The term "functional in vivo half-life" is used in its normal meaning, i.e. the 

time at which 50% of the biological activity of the polypeptide or conjugate is still present in 
the body/target organ, or the time at which the activity of the polypeptide or conjugate is 50% 
of the initial value. As an alternative to determining functional in vivo half-life, "serum half- 
life" may be determined, i.e. the time at which 50% of the dispensed polypeptide or conjugate 

20 molecules is still present in the circulation/plasma/bloodstream. The magnitude of serum half- 
life is usually a good indication of the magnitude of functional in vivo half-life. Alternative 
terms to serum half-life include "plasma half-life", "circulating half-life", "serum clearance", 
"plasma clearance" and "clearance half-life". The polypeptide or conjugate is cleared by the 
action of one or more of the kidney, reticuloendothelial systems (RES), spleen or liver, by 

25 FSH-receptor-mediated elimination, or by specific or non-specific proteolysis. Normally, 
clearance depends on size (relative to the cutoff for glomerular filtration), charge, attached 
carbohydrate chains, and the presence of cellular receptors for the protein. The functional in 



WO 01/58493 PCT/DK01/00090 

11 

vivo half-life and the serum half-life may be determined by any suitable method known in the 
art as further discussed in the Examples section hereinafter. 

The term "increased" as used about the functional in vivo half -life or serum 
half-life is used to indicate that the relevant half-life of the conjugate or polypeptide is statisti- 
5 cally significantly increased relative to that of a reference molecule, such as a non-conjugated 
rhFSH (recombinant hFSH), e.g. Gonal-F® (available from Serono) or Puregon® (available 
from Organon), as determined under comparable conditions. For instance, the relevant half- 
life may be increased by at least about 25%, such as by at least about 50%, e.g. by at least 
about 100%, 200% or 500%. 
10 The term "renal clearance" is used in its normal meaning to indicate any clear- 

ance taking place by the kidneys, e.g. by glomerular filtration, tubular excretion or tubular 
elimination. Renal clearance depends on physical characteristics of the conjugate, including 
size (diameter), symmetry, shape/rigidity and charge. Reduced renal clearance may be estab- 
lished by any suitable assay, e.g. an established in vivo assay. Typically, renal clearance is 
15 determined by administering a labelled (e.g. radioactive or fluorescent labelled) polypeptide 
conjugate to a patient and measuring the label activity in urine collected from the patient. Re- 
duced renal clearance is determined relative to a corresponding reference polypeptide, e.g. the 
corresponding non-conjugated polypeptide, a non-conjugated corresponding wild-type poly- 
peptide or another conjugated polypeptide (such as a conjugated polypeptide not according to 
20 the invention), under comparable conditions. 

In some cases, it will be preferred to obtain a clearance that is only slightly re- 
duced (i.e. total clearance by renal clearance, receptor-mediated clearance and/or other clear- 
ance mechanisms), e.g. to increase the in vivo half-life from about 24 hours to about 3-4 days, 
while in other cases a longer half-life of e.g. about 6-7 days will be desired. As will be ex- 
25 plained in further detail below, the number and size of such polymer molecules may be 

adapted in order to obtain a desired clearance, as well as other desired properties, suitable for 
a given clinical indication. Preferably, the conjugate of the invention has a reduced clearance 
of at least about 50%, such as least about 75% or at least about 90%, as compared to the cor- 
responding non-conjugated polypeptide (such as hFSH or rhFSH) as determined under com- 
30 parable conditions. 

Generally, activation of the receptor is coupled to receptor-mediated clearance 
(KMC) such that binding of a polypeptide to its receptor without activation does not lead to 
RMC, while activation of the receptor leads to RMC. The clearance is due to internalisation of 
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the receptor-bound polypeptide with subsequent lysosomal degradation. Reduced RMC may 
therefore be achieved by designing the conjugate so as to be able to bind and activate a suffi- 
cient number of receptors to obtain optimal in vivo biological response and avoid activation of 
more receptors than required for obtaining such response, e.g. by substitution, polymer conju- 

5 gation or other modification of one or more amino acid residues located at or near a receptor- 
binding site. This may be reflected in reduced in vitro bioactivity and/or increased off-rate. 

The term "FSH-oc" is intended to indicate a polypeptide having qualitatively 
similar functions or activities as the corresponding wildtype FSH oc subunit, including the 
capability of forming a dimeric polypeptide with an FSH-|3 subunit (FSH-fS), which dimeric 

10 polypeptide exhibits FSH activity. Alternatively used terms include "FSH-oc polypeptide", 
"FSH-oc subunit", and "modified FSH-oc". Analogously, the term "FSH-(3" is intended to indi- 
cate a polypeptide having qualitatively similar functions or activities as the corresponding 
wildtype FSH (5 subunit, including the capability of dimerizing with FSH-oc and thereby form- 
ing a dimeric polypeptide exhibiting FSH activity. Alternatively used terms include "FSH-p 

15 polypeptide", "FSH-J3 subunit", and "modified FSH-P". 

The term "exhibiting FSH activity" is intended to indicate that the conjugate or 
polypeptide has one or more of the functions of wildtype FSH, in particular hFSH, including 
the capability of binding to and activating an FSH receptor. The FSH activity is conveniently 
assayed using the in vitro activity assay described in the Examples section below. The conju- 

20 gate or polypeptide "exhibiting" FSH activity is considered to have such activity when it dis- 
plays a measurable function, e.g. a measurable activity. The dimeric polypeptide exhibiting 
FSH activity may also be termed " FSH molecule" herein. 

Conjugate of the invention 

25 As stated above, in a first aspect the invention relates to a polypeptide conju- 

gate exhibiting FSH activity, comprising i) a polypeptide comprising FSH-oc and FSH-p sub- 
units, wherein at least one of the FSH-oc and FSH-p subunits differs from the corresponding 
wildtype subunit in at least one introduced or removed amino acid residue comprising an 
attachment group for non-polypeptide moiety, and ii) a non-polypeptide moiety bound to an 

30 attachment group of the polypeptide. Examples of amino acid residues that may be introduced 
and/or removed are described in further detail in the following sections. 
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By removing and/or introducing an amino acid residue comprising an attach- 
ment group for the non-polypeptide moiety, it is possible to specifically adapt the polypeptide 
so as to make the molecule more susceptible to conjugation to the non-polypeptide moiety of 
choice, to optimize the conjugation pattern (e.g. to ensure an optimal distribution of non- 

5 polypeptide moieties on the surface of the FSH molecule and to ensure that only the attach- 
ment groups intended to be conjugated are present in the molecule) and thereby obtain a new 
conjugate molecule which has FSH activity and in addition one or more improved properties 
as compared to FSH molecules available today, in particular increased functional in vivo half- 
life and/or reduced clearance. 

io In the conjugate of the invention, one or both of the FSH subunits may be 

modified according to the invention. For instance, the amino acid sequence of FSH-oc may be 
modified as described herein, whereas FSH-p is unmodified, and vice versa. Alternatively, 
both of FSH-a and FSH-p may be modified according to the invention. 

While the FSH-a and/or FSH-P may be of any origin, it is in particular of 

15 mammalian origin, and preferably of human origin. Accordingly, the corresponding wildtype 
subunits referred to above are preferably hFSH-a and hFSH-P, respectively, with the amino 
acid sequences shown in SEQ ED NO: 2 and 4. 

In a preferred embodiment one difference between the amino acid sequence of 
FSH-a and/or FSH-p and the corresponding wildtype sequence is that at least one and pref- 

20 erably more, e.g. 1-20, amino acid residues comprising an attachment group for the non- 
polypeptide moiety have been introduced, by insertion or substitution, into the amino acid 
sequence. Thereby, properties such as the molecular weight, shape, size and/or charge of the 
conjugate can be optimised. Preferably, such amino acid residues are introduced in positions 
occupied by an amino acid residue having more than 25%, more preferably more than 50%, 

25 such as more than 75% of its side chain exposed at the surface of the molecule. 

The term "one difference" as used in the present application is intended to allow 
for additional differences being present. Accordingly, in addition to the specified amino acid 
difference, other amino acid residues than those specified may be mutated. 

In one embodiment, one difference between the amino acid sequence of FSH-a 

30 and/or FSH-p and that of the corresponding wildtype polypeptide is that at least one and pos- 
sibly more, e.g. 1-15, amino acid residues comprising an attachment group for the non- 
polypeptide moiety have been removed, by substitution or deletion, from the amino acid se- 
quence. The amino acid residue to be removed is preferably one to which conjugation is dis- 
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advantageous, e.g. an amino acid residue located at or near a functional site of the polypeptide 
(since conjugation at such a site may result in inactivation or reduced FSH activity of the re- 
sulting conjugate due to impaired receptor recognition). In the present context the term "func- 
tional site" is intended to indicate one or more amino acid residues which are essential for or 
otherwise involved in the function or performance of hFSH, in particular dimerization and/or 
receptor binding and activation. Such amino acid residues are a part of a functional site. The 
functional site may be determined by methods known in the art and is preferably identified by 
analysis of a structure of the polypeptide complexed to a relevant receptor, such as the hFSH 
receptor. 

In another embodiment, the alteration of FSH-a and/or FSH-p embraces re- 
moval as well as introduction of amino acid residues comprising an attachment group for the 
non-polypeptide moiety of choice. 

In order to avoid too much disruption of the structure and function of the FSH 
molecule, the total number of amino acid residues to be altered in accordance with the present 
invention will typically not exceed 20 for each individual subunit. Preferably, the polypeptide 
part of the conjugate of the invention or the dimeric polypeptide of the invention comprises an 
amino acid sequence which differs in a total of 1-20 amino acid residues from the amino acid 
sequences shown in SEQ ID NO:2 and/or SEQ ID NO:4, such as in 1-15 or 2-12 amino acid 
residues, e.g. in 3-10 amino acid residues. Thus, normally the polypeptide part of the conju- 
gate or the dimeric polypeptide of the invention comprises an amino acid sequence which in 
total differs from the amino acid sequences shown in SEQ ID NO:2 and/or SEQ ID NO:4 in 1, 
2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 amino acid residues. 

The FSH-a and/or FSH-p subunits of the dimeric polypeptide are preferably 
any of the specific modified FSH-a and/or FSH-(3 polypeptides disclosed in the subsequent 
sections having introduced and/or removed amino acid residues comprising an attachment 
group for the relevant non-polypeptide moiety. 

The amino acid residue comprising an attachment group for a non-polypeptide 
moiety, whether it is removed or introduced, is selected on the basis of the nature of the non- 
polypeptide moiety of choice and, in most instances, on the basis of the method in which con- 
jugation between the polypeptide and the non-polypeptide moiety is to be achieved. It will be 
understood that in order to preserve a measurable function of the modified FSH-a and/or 
FSH-p, amino acid residues to be modified (by deletion or substitution) are selected from 
those amino acid residues which are not essential for providing a measurable activity. Accord- 
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ingly, amino acid residues to be modified are different from those required for subunit dimeri- 
zation and/or receptor binding or activation. The identity of such amino acid residues is de- 
scribed in the art (e.g. references identified in the Background section above) or can be deter- 
mined by a person skilled in the art using methods known in the art. 

5 In addition to the amino acid alterations disclosed herein aimed at introducing 

and/or removing attachment sites for the non-polypeptide moiety, the FSH-oc and/or FSH-J3 
subunits may comprise other amino acid alterations that need not be related to introduction or 
removal of attachment sites, i.e. other substitutions, insertions or deletions. These may, for 
example, include truncation of the N- and/or C-terminus by one or more amino acid residues, 

10 or addition of one or more extra residues at the N- and/or C-terminus. Examples of such addi- 
tional amino acid changes include adding part of or the entire CTP region of hCG to the C- 
terminus of FSH-a or introducing any other mutation (in particular selected among those re- 
ported to enhance FSH activity and/or increase the functional in vivo half-life, cf. the Back- 
ground of the Invention section herein). In such cases, the amino acid sequence of the basic 

15 polypeptide subunits, i.e. the sequence of the subunits excluding any introduced or removed 
attachment sites, will typically have a degree of homology, compared to the relevant wildtype 
sequence (normally hFSH-a or hFSH-P), of at least about 80%, more typically at least about 
90%, such as at least about 95%. Amino acid sequence homology/identity is conveniently 
determined from aligned sequences, using e.g. the ClustalW program or from the PFAM fami- 

20 lies database version 4.0 (http://pfam.wustl.edu/) (Nucleic Acids Res. 1999 Jan 1; 27(l):260-2) 
by use of GENEDOC version 2.5 (Nicholas, K.B., Nicholas H.B. Jr., and Deerfield, D.W. II. 
1997 GeneDoc: Analysis and Visualization of Genetic Variation, EMBNEW.NEWS 4:14; 
Nicholas, K.B. and Nicholas H.B. Jr. 1997 GeneDoc: Analysis and Visualization of Genetic 
Variation). 

25 Preferably, the conjugate of the present invention has one or more improved 

properties as compared to hFSH, including increased functional in vivo half-life, increased 
serum half -life, reduced renal clearance, reduced receptor-mediated clearance, reduced immu- 
nogenicity and/or an increased bioavailability as compared to rhFSH (e.g. Gonal-F® or Pure- 
gon®). Consequently, medical treatment with a conjugate of the invention offers advantages 

30 over the currently available FSH compounds, in particular longer duration between injections. 
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Conjugate of the invention wherein the non-polvpeptide moiety is an oligosaccharide moiety 

It has been found that N-glycosylation is important for FSH activity and also 
that the extent and type of oligosaccharide moiety attached by in vivo glycosylation is impor- 
tant for functional in vivo half-life of the glycosylated FSH. In order to obtain a different, in- 
creased glycosylation it is desirable to introduce at least one glycosylation site. Accordingly, 
in a preferred aspect the invention relates to a heterodimeric polypeptide conjugate exhibiting 
FSH activity comprising a dimeric polypeptide comprising an FSH-a subunit and an FSH~(3 
subunit, wherein the amino acid sequence of at least one of the FSH-a and FSH-P subunits 
differs from that of the corresponding wildtype subunit in that at least one N-glycosylation 
site has been introduced, and having at least one oligosaccharide moiety bound to an N- 
glycosylation site of at least one of the subunits. 

A suitable N-glycosylation site may be introduced by introducing, by substitu- 
tion or insertion, an asparagine residue in a position occupied by an amino acid residue having 
more than 25% of its side chain exposed at the surface of the polypeptide, which position does 
not have a proline residue located in position +1 or +3 therefrom. If the amino acid residue 
located in position +2 is a serine or threonine, no further amino acid substitution is required. 
However, if this position is occupied by a different amino acid residue, a serine or threonine 
residue needs to be introduced. 

A preferred conjugate according to this embodiment is one which comprises a 
modified FSH-a subunit having an amino acid residue which differs from that of hFSH-a in 
the introduction of at least one N-glycosylation site by means of a mutation selected from the 
group consisting of P2(a)N+V4(a)S, P2(a)N+V4(a)T, D3(a)N+Q5(a)S, D3(a)N+Q5(a)T, 
V4(a)N+D6(a)S, V4(a)N+D6(a)S, D6(a)N+P8(a)S, D6(a)N+P8(a)T, E9(a)N+Tll(a)S, 
E9(a)N, Tll(a)N+Q13(a)S, Tll(a)N+Q13(a)T, L12(a)N+E14(a)S, L12(a)N+E14(a)T, 
E14(a)N+P16(a)S, E14(a)N+P16(a)T, P16(a)N+F18(a)S, P16(a)N+F18(a)T, F17(a)N, 
F17(a)N+S19(a)T, G22(a)N+P24(a)S, G22(a)N+P24(a)T, P24(a)N+L26(a)S, 
P24(a)N+L26(a)T, F33(a)N+R35(a)S, F33(a)N+R35(a)T, R42(a)N+K44(a)S, 
R42(a)N+K44(a)T, S43(a)N+K45(a)S, S43(a)N+K45(a)T, K44(a)N+T46(a)S, K44(a)N, 
K45(a)N-hM47(a)S, K45(a)N+M47(a)T, T46(a)N+L48(a)S, T46(a)N+L48(a)T, 
L48(a)N+Q50(a)S, 148(a)N+Q50(a)T, V49(a)N+K51(a)S, V49(a)N+K51(a)T, 
Q50(a)N+N52(a)S, Q50(a)N+N52(a)T, V61(a)N+K63(a)S, V61(a)N+K63(a)T, 
K63(a)N+Y65(a)S, K63(a)N+Y65(a)T, S64(a)N+N66(a)S, S64(a)N+N66(a)T, 
Y65(a)N+R67(a)S, Y65(a)N+R67(a)T, V68(a)S, V68(a)T, R67(a)N+T69(a)S, R67(a)N, 
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T69(a)N+M71(a)S, T69(a)N+M71(a)T, M71(a)N+G73(a)S, M71(a)N+G73(a)T, 
G72(a)N+F74(a)S, G72(a)N+F74(a)T, G73(a)N+K75(a)S, G73(a)N+K75(a)T, 
F74(a)N+V76(a)S, F74(a)N+V76(a)T, K75(a)N+E77(a)S, K75(a)N+E77(a)T, 
A81(a)N+H83(a)S, A81(a)N+H83(a)T, H83(a)N, T86(a)N+Y88(a)S, T86(a)N+Y88(a)T, 
Y88(a)N+H90(a)S, Y88(a)N+H90(a)T, Y89(a)N+K91(a)S, Y89(a)N+K91(a)T, H90(a)N and 
H90(a)N+S92(a)T (positions with more than 25% side chain exposure). Among these possible 
positions for mutation, more preferred mutations are those where a glycosylation site can be 
introduced by mutation of a single amino acid residue, i.e. selected from the group consisting 
of V68(a)S, V68(a)T, E9(a)N, F17(a)N, K44(a)N, R67(a)N, H83(a)N and H90(a)N. 

More preferably, a glycosylation site is introduced at a position having more 
than 50% side chain exposure, i.e. by means of a mutation selected from the group consisting 
of P2(a)N+V4(a)S, P2(a)N+V4(a)T, D3(a)N+Q5(a)S, D3(a)N+Q5(a)T, V4(a)N+D6(a)S, 
V4(a)N+D6(a)S, D6(a)N+P8(a)S, D6(a)N+P8(a)T, E9(a)N+Tll(a)S, E9(a)N, 
Tll(a)N+Q13(a)S, Tll(a)N+Q13(a)T, E14(a)N+P16(a)S, E14(a)N+P16(a)T, 
P16(a)N+F18(a)S, P16(a)N+F18(a)T, F17(a)N, F17(a)N+S19(a)T, G22(a)N+P24(a)S, 
G22(a)N+P24(a)T, K45(a)N+M47(a)S, K45(a)N+M47(a)T, T46(a)N+L48(a)S, 
T46(a)N+L48(a)T, L48(a)N+Q50(a)S, 148(a)N+Q50(a)T, V49(a)N+K51(a)S, 
V49(a)N+K51(a)T, Q50(a)N+N52(a)S, Q50(a)N+N52(a)T, K63(a)N+Y65(a)S, 
K63(a)N+Y65(a)T, S64(a)N+N66(a)S, S64(a)N+N66(a)T, V68(a)S, V68(a)T, 
R67(a)N+T69(a)S, R67(a)N, T69(a)N+M71(a)S, T69(a)N+M71(a)T, G72(a)N+F74(a)S, 
G72(a)N+F74(a)T, G73(a)N+K75(a)S, G73(a)N+K75(a)T, K75(a)N+E77(a)S, 
K75(a)N+E77(a)T, T86(a)N+Y88(a)S, T86(a)N+Y88(a)T, Y89(a)N+K91(a)S, 
Y89(a)N+K91(a)T, H90(a)N, and H90(a)N+S92(a)T. Still more preferably, glycosylation 
sites are introduced via mutation of a single amino acid residue selected from the group con- 
sisting of E9(a)N, F17(a)N, R67(a)N, andH90(a)N. 

The FSH-P part of such conjugates with an altered FSH-oc subunit may be 
hFSH-P or any of the modified FSH-P polypeptides described herein. 

Alternatively or additionally, the conjugate according to this embodiment com- 
prises a modified FSH-P having an amino acid residue which differs from that of hFSH-P in 
the introduction of at least one N-glycosylation site by a mutation selected from the group 
consisting of S2(b)N+E4(b)S, S2(b)N+E4(b)T, E4(b)N+T6(b)S, E4(b)N, L5(b)N+N7(b)S, 
L5(b)N+L7(b)T, T6(b)N+I8(b)S, T6(b)N+I8(b)T, I8(b)N+I10(b)S, I8(b)N+I10(b)T, 
T9(b)N+All(b)S, T9(b)N+All(b)T, K14(b)N+E16(b)S, K14(b)N+E16(b)T, 
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F19(b)N+I21(b)S, F19(b)N+I21(b)T, I21(b)N+I23(b)S, I21(b)N+I23(b)T, S22(b)N+N24(b)S, 
S22(b)N+N24(b)T, Y31(b)N+Y33(b)S, Y31(b)N+Y33(b)T, Y33(b)N+R35(b)S, 
Y33(b)N+R35(b)T, R35(b)N+L37(b)S, R35(b)N+L37(b)T, D36(b)N+V38(b)S, 
D36(b)N+V38(b)T, L37(b)N+Y39(b)S, L37(b)N+Y39(b)T, K40(b)N+P42(b)S, 
K40(b)N+P42(b)T, A43(b)N+P45(b)S, A43(b)N+P45(b)T, P45(b)N+I47(b)S, 
P45(b)N+I47(b)T, K46(b)N+Q48(b)S, K46(b)N+Q48(b)T, I47(b)N+K49(b)S, 
I47(b)N+K49(b)T, K54(b)N+L56(b)S, K54(b)N+L56(b)T, E55(b)N+V57(b)S, 
E55(b)N+V57(b)T, L56(b)N+Y58(b)S, L56(b)N+Y58(b)T, V57(b)N+E59(b)S, 
V57(b)N+E59(b)T, Y58(b)N+T60(b)S, Y58(b)N, E59(b)N+V61(b)S, E59(b)N+V61(b)T, 
T60(b)N+R62(b)S, T60(b)N+R62(b)T, R62(b)N+P64(b)S, R62(b)N+P64(b)T, 
G65(b)N+A67(b)S, G65(b)N+A67(b)T, A67(b)N+H69(b)S, A67(b)N+H69(b)T, 
H68(b)N+A70(b)S, H68(b)N+A70(b)T, H69(b)N+D71(b)S, H69(b)N+D71(b)T, 
D71(b)N+L73(b)S, D71(b)N+L73(b)T, L73(b)N+T75(b)S, L73(b)N, T75(b)N+P77(b)S, 
T75(b)N+P77(b)T, H83(b)N+G85(b)S, H83(b)N+G85(b)T, K86(b)N+D88(b)S, 
K86(b)N+D88(b)T, D88(b)N+D90(b)S, D88(b)N+D90(b)T, S89(b)N, S89(b)N+S91(b)T, 
D90(b)N+T92(b)S, D90(b)N, S91(b)N+D93(b)S, S91(b)N+D93(b)T, D93(b)N+T96(b)S, 
D93(b)N, T95(b)N+R97(b)S, T95(b)N+R97(b)T, V96(b)N+G98(b)S, V96(b)N+G98(b)T, 
R97(b)N+L99(b)S, R97(b)N+L99(b)T, L99(b)N+P101(b)S, L99(b)N+P101(b)T, Y103(b)N, 
Y103(b)N+S105(b)T, S105(b)N+G107(b)S, S105(b)N+G107(b)T, F106(b)N+E108(b)S, 
F106(b)N+E108(b)T, G107(b)N+M109(b)S, G107(b)N+M109(b)T, E108(b)N+Kl 10(b)S, 
E108(b)N+K110(b)T, M109(b)N+Elll(b)S, and M109(b)N+Elll(b)T (mutations at posi- 
tions with at least 25% side chain exposure). Preferably, glycosylation sites are introduced by 
means of mutation of a single amino acid residue selected from the group consisting of 
E4(b)N, Y58(b)N, L73(b)N, S89(b)N, D90(b)N, D93(b)N, and Y103(b)N. 

More preferably, a modified FSH-(3 has an amino acid residue which differs 
from that of hFSH-(3 in the introduction of at least one N-glycosylation site by a mutation se- 
lected from the group consisting of F19(b)N+I21(b)S, F19(b)N+I21(b)T, Y33(b)N+R35(b)S, 
Y33(b)N+R35(b)T, A43(b)N+P45(b)S, A43(b)N+P45(b)T, P45(b)N+I47(b)S, 
P45(b)N+I47(b)T, K46(b)N+Q48(b)S, K46(b)N+Q48(b)T, I47(b)N+K49(b)S, 
I47(b)N+K49(b)T, K54(b)N+L56(b)S, K54(b)N+L56(b)T, E55(b)N+V57(b)S, 
E55(b)N+V57(b)T, V57(b)N+E59(b)S, V57(b)N+E59(b)T, Y58(b)N+T60(b)S, Y58(b)N, 
E59(b)N+V61(b)S, E59(b)N+V61(b)T, R62(b)N+P64(b)S, R62(b)N+P64(b)T, 
G65(b)N+A67(b)S, G65(b)N+A67(b)T, A67(b)N+H69(b)S, A67(b)N+H69(b)T, 
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H68(b)N+A70(b)S, H68(b)N+A70(b)T, H69(b)N+D71(b)S, H69(b)N+D71(b)T, 
D71(b)N+L73(b)S, D71(b)N+L73(b)T, L73(b)N+T75(b)S, L73(b)N, T75(b)N+P77(b)S, 
T75(b)N+P77(b)T, H83(b)N+G85(b)S, H83(b)N+G85(b)T, K86(b)N+D88(b)S, 
K86(b)N+D88(b)T, D88(b)N+D90(b)S, D88(b)N+D90(b)T, S89(b)N, S89(b)N+S91(b)T, 
D90(b)N+T92(b)S, D90(b)N, S91(b)N+D93(b)S, S91(b)N+D93(b)T, T95(b)N+R97(b)S, 
T95(b)N+R97(b)T, R97(b)N+L99(b)S, R97(b)N+L99(b)T, L99(b)N+P101(b)S, 
L99(b)N+P101(b)T, Y103(b)N, Y103(b)N+S105(b)T, S105(b)N+G107(b)S, 
S105(b)N+G107(b)T, F106(b)N+E108(b)S, F106(b)N+E108(b)T, G107(b)N+M109(b)S, 
G107(b)N+M109(b)T, E108(b)N+K110(b)S, E108(b)N+K110(b)T, M109(b)N+Elll(b)S, 
and M109(b)N+Elll(b)T (positions having more than 50% side chain accessibility). Among 
these positions, it is preferred to introduce glycosylation sites using mutation of a single 
amino acid residue selected from the group consisting of Y58(b)N, L73(b)N, S89(b)N, 
D90(b)N, and Y103(b)N. 

The FSH-a part of such conjugates with an altered FSH-p subunit may be 
hFSH-a or any of the modified FSH-a polypeptides described herein. 

The FSH-a and/or FSH-p polypeptide may further differ from hFSH-a and/or 
hFSH-P in at least one removed, naturally occurring N-glycosylation site. In particular, FSH- 
a may comprise a substitution of N78(a) and/or T80(a) by any other amino acid residue 
and/or FSH-|3 may comprise a substitution of N7(b), T9(b), N24(b) and/or T26(b) by any 
other amino acid residue. Preferably, the N residue is substituted by Q or D, and the T residue 
by A or G. 

Furthermore, one or both of the FSH-a and FSH-p subunits of the conjugate 
according to this embodiment (having at least one of the above mentioned N-glycosylation 
site modifications) may differ from hFSH-a and hFSH-P, respectively, in the removal, pref- 
erably by substitution, of at least one lysine residue. See the section below on removal of ly- 
sine residues for further details. 

An alternative embodiment of this aspect of the invention is one in which at 
least one of said FSH-a and FSH-(3 subunits comprises at least one introduced N- or O- 
glycosylation site at the N-terminal thereof, and wherein the at least one introduced glycosyla- 
tion site is glycosylated; see the discussion of peptide addition below. In this case, the respec- 
tive subunits may comprise one or more of the modifications disclosed elsewhere herein, or 
one or both of the subunits may be the respective wildtype subunits, but having the at least 
one introduced terminal glycosylation site. Thus, the polypeptide conjugate may be one in 
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which the FSH-a subunit comprises hFSH-a having the sequence shown in SEQ ID NO:2, 
and/or in which the FSH-0 subunit comprises hFSH-p having the sequence shown in SEQ ID 
NO:4. In a particular embodiment, both of the subunits correspond to the respective wildtype 
hFSH subunits, although with either the a or 0 subunit, or both, having an introduced N- 

5 terminal glycosylation site. 

The introduced glycosylation site may be of the type described elsewhere 
herein; see the discussion of glycosylation under the general discussion of attachment groups 
above. A non-limiting example of a suitable glycosylation site for introduction at the N- 
terminal is the sequence Ala-Asn-Ile-Thr-Val-Asn-Ile-Thr-Val, e.g. for insertion of two gly- 

10 cosylation sites upstream of a mature FSH-a or FSH-(3 sequence. 

Introduction of glycosylation sites by means of peptide addition 

In addition to or as an alternative to introducing glycosylation sites within the 
amino acid sequence of one or both of the subunits, one or more additional glycosylation sites 
15 may be introduced by means of a "peptide addition" as discussed in the following. In this 
case, each of the polypeptide subunits comprises or consists of or consists essentially of the 
primary structure, 

NH 2 - X-P-COOH or NH 2 -P-X-COOH, 

wherein 

20 X is a peptide addition comprising or contributing to a glycosylation site, and P 

is the basic polypeptide subunit to be modified, i.e. FSH-a or FSH-(3, e.g. a wildtype polypep- 
tide subunit as defined herein or a modified polypeptide having introduced and/or removed 
glycosylation sites or other attachment sites in the mature part of the polypeptide. 

In the context of a peptide addition the term "comprising a glycosylation site" 

25 is intended to mean that a complete glycosylation site is present in the peptide addition, 

whereas the term "contributing to a glycosylation site" is intended to cover the situation where 
at least one amino acid residue of an N-glycosylation site is present in the peptide addition 
while the other amino acid residue of said site is present in the polypeptide P, whereby the 
glycosylation site can be considered to bridge the peptide addition and the polypeptide. 

30 Usually, the peptide addition is fused to the N-terminal or C-terminal end of the 

polypeptide P as reflected in the above shown structure so as to provide an N- or C-terminal 
elongation of the polypeptide P, preferably at the N-terminal. However, it is also possible to 
insert the peptide addition within the amino acid sequence of the polypeptide P whereby the 
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polypeptide comprises, consists of or consists essentially of the primary structure NH 2 -P X -X- 
P y -COOH, wherein 

P x is an N-terminal part of the relevant polypeptide P, 

P y is a C-terminal part of said polypeptide P, and 

X is a peptide addition comprising or contributing to a glycosylation site. 
In order to minimize structural changes effected by the insertion of the peptide 
addition within the sequence of the polypeptide P, it is desirable that it be inserted in a non- 
structural part thereof. For instance, P x may be a non-structural N-terminal part of a mature 
polypeptide P, and P y a structural C-terminal part of said mature polypeptide, or P x may be a 
structural N-terminal part of a mature polypeptide P, and P y a non-structural C-terminal part of 
said mature polypeptide. 

The term "non-structural part" is intended to indicate a part of either the C- or 
N-terminal end of the folded polypeptide subunit that is outside the first structural element, 
such as an a-helix or a f3-sheet structure. The non-structural part can easily be identified in a 
three-dimensional structure or model of the polypeptide. If no structure or model is available, 
a non-structural part typically comprises or consists of the first or last 1-20 amino acid resi- 
dues, such as 1-10 amino acid residues of the amino acid sequence constituting the mature 
form of the polypeptide. 

When the peptide addition comprises only few amino acid residues, e.g. 1-5, 
such as 1-3 amino acid residues, and in particular one amino acid residue, the peptide addition 
can be inserted into a loop structure of the polypeptide P and thereby elongate the loop. 

In principle, the peptide addition X can be any stretch of amino acid residues 
ranging from a single amino acid residue to a mature protein. In the present context, it is con- 
templated that each peptide addition will normally comprise up to about 50 amino acid resi- 
dues, such as 2-30 or 3-20 amino acid residues. The peptide addition may be designed by a 
site-specific or random approach. In order to minimize the risk of an immunogenic response, 
however, it is preferable to select N- or C-terminal extensions of the FSH sequence that com- 
prise peptide sequences that are part of naturally occurring human proteins. Non-limiting ex- 
amples of such peptide sequences include the sequence NSTQNATA, which corresponds to 
positions 231 to 238 of the human calcium activated channel 2 precursor (to add two N- 
glycosylation sites to FSH), or the sequence ANLTVRNLTRNVTV, which corresponds to 
positions 538 to 551 of the human G protein coupled receptor 64 (to add three N- 
glycosylation sites to FSH). 
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Typically, each peptide addition X comprises 1-10 glycosylation sites. The 
peptide addition X may thus comprise 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 glycosylation sites. It is 
well known that a frequently occurring consequence of modifying an amino acid sequence of, 
e.g., a human protein is that new epitopes are created by such modification. Non-polypeptide 
moieties may be used to shield any new epitopes created by the peptide addition, and there- 
fore it is desirable that sufficient glycosylation sites (or attachment groups for another non- 
polypeptide moiety, e.g. a polymer such as PEG) are present to enable shielding of all epi- 
topes introduced into the sequence. This is e.g. achieved when the peptide addition X com- 
prises at least one glycosylation site within a stretch of 30 contiguous amino acid residues, 
preferably as at least one glycosylation sites within 20 amino acid residues, more preferably at 
least one attachment group within 10 amino acid residues, in particular 1-3 attachment groups 
within a stretch of 10 contiguous amino acid residues in the peptide addition X. 

Preferably, the glycosylation site of the peptide addition is an in vivo glycosyla- 
tion site, preferably an N-glycosylation site. For instance, the peptide addition X may have the 
structure Xi-N-X 2 -T/S/C-Z, wherein Xi is a peptide comprising at least one amino acid resi- 
due or is absent, X 2 is any amino acid residue different from P, and Z is absent or is a peptide 
comprising at least one amino acid residue. For instance, Xi may absent, X 2 may be an amino 
acid residue selected from the group consisting of I, A, G, V and S (all relatively small amino 
acid residues), and Z may comprise at least 1 amino acid residue. Z can e.g. be a peptide com- 
prising up to 50 amino acid residues and e.g. up to 10 glycosylation sites. 

Alternatively, Xi may comprise at least one amino acid residue, e.g. 1-50 
amino acid residues with 1-10 glycosylation sites, X 2 may be an amino acid residue selected 
from the group consisting of I, A, G, V and S, and Z may be absent. 

Examples of peptide additions for use in the present invention are 
ANITVNITV, NDTVNFT and NITVNITV; see Examples 9 and 10 below, which illustrate 
addition of these sequences at the N-terminal of the FSH-a and (3 subunits. 

The peptide addition can comprise one or more of these peptide sequences, i.e. 
at least two of said sequences either directly linked together or separated by one or more 
amino acid residues, or can contain two or more copies of any of these peptide sequence. It 
will be understood that the above specific sequences are given for illustrative purposes and 
thus do not constitute an exhaustive list of peptide sequences of use in the present invention. 

In one embodiment, the peptide addition X has an N residue in position -2 or - 
1, and the polypeptide P or P x has a T or an S residue in position +1 or +2, respectively, the 
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residue numbering being made relative to the N-terminal amino acid residue of P or P x , 
whereby an N-glycosylation site is formed. For instance, the polypeptide may have a T or S 
residue in position 2, preferably a T residue, and the peptide addition is AN or comprises AN 
as the C-terminal amino acid residues. 

5 

O-glycosylation 

As an alternative or in addition to the mutations discussed above, the het- 
erodimeric polypeptide may comprise one or more introduced O-glycosylation sites, for ex- 
ample the amino acid sequence AATPAP, which has been found to be an efficient signal se- 
10 quence for O-glycosylation in vivo (Asada et al. (1999) Glycoconj. J. 16(7):321-6). The 

AATPAP sequence for O-glycosylation is preferably introduced by way of insertion at the N- 
and/or C-terminus of the FSH-oc and/or FSH-|3 subunit 

Preparation of glycosylated conjugates 

15 It will be understood that in order to prepare a conjugate according to this as- 

pect, the polypeptide must be expressed in a glycosylating host cell capable of attaching oli- 
gosaccharide moieties at the glycosylation site(s) in vivo or alternatively subjected to in vitro 
glycosylation. Examples of glycosylating host cells are given in the section further below enti- 
tled "Coupling to an oligosaccharide moiety". 

20 In addition to an oligosaccharide moiety, the conjugate according to the aspect 

of the invention described in the present section may contain additional non-polypeptide 
moieties different from O-linked or N-linked oligosaccharide moieties, in particular a polymer 
molecule such as PEG as described herein conjugated to one or more attachment groups pre- 
sent in the polypeptide part of the conjugate. This is particularly relevant when a lysine resi- 

25 due (or any other amino acid residue comprising an attachment group for the polymer mole- 
cule in question) has been introduced and/or removed. 

It will be understood that any of the amino acid changes specified in this sec- 
tion can be combined with any of the amino acid changes specified in the other sections 
herein disclosing specific amino acid changes. 



30 
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Conjugate of the invention wherein the non-polvpeptide moiety is attached to a lysine or the 
N-terminal amino acid residue 

In a further preferred embodiment the conjugate of the invention is one wherein 
the amino acid residue comprising an attachment group for the non-polypeptide moiety is a 
lysine residue and the non-polypeptide moiety is any molecule which has lysine as an attach- 
ment group. For instance, the non-polypeptide moiety may be a polymer molecule, in particu- 
lar any of the molecules mentioned in the section entitled "Conjugation to a polymer mole- 
cule", and preferably selected from the group consisting of linear or branched polyethylene 
glycol and polyalkylene oxide. Most preferably, the polymer molecule is mPEG-SPA or oxy- 
carbonyl-oxy-N-dicarboxyimide PEG (US 5,122,614). 

The FSH-a and/or FSH- (3 having introduced and/or removed at least one lysine 
may advantageously be in vivo glycosylated, e.g. using naturally occurring glycosylation sites 
present in the relevant FSH polypeptide. However, in a particular embodiment the conjugate 
is one wherein the amino acid sequence of FSH-a and/or FSH P differs from that of FSH-a 
and/or FSH-(3 in that an N-glycosylation site has been introduced and/or removed. Such intro- 
duced/removed sites may be any of those described in the section entitled "Conjugate of the 
invention wherein the non-polypeptide moiety is an oligosaccharide moiety". 

i) Removal of lysine residues 

hFSH-a contains 6 lysine residues and hFSH-p 7. In order to avoid conjugation 
to one or more of these lysine residues, e.g. lysine residues located at or close to the receptor- 
binding site of hFSH, it may be desirable to remove at least one lysine residue. Accordingly, 
in one embodiment the conjugate of the invention is one which comprises a modified FSH-a 
having an amino acid residue which differs from that of hFSH-a in the removal of at least one 
lysine residue selected from the group consisting of K44(a), K45(a), K51(a), K63(a), K75(a), 
and K91(a), in particular at least one amino acid residue selected from of the group consisting 
of K44(a), K45(a), K63(a), K75(a), and K91(a) (these residues having more than 25% of their 
side chain exposed to the surface), and preferably from the group consisting of K45(a), 
K63(a), K75(a), and K91(a) (these residues having more than 50% of their side chain exposed 
to the surface). The FSH-|3 part of this conjugate may be hFSH-(3 or any of the modified FSH- 
(3 polypeptides described herein. 
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In another embodiment the conjugate of the invention is one which comprises a 
modified FSH-(3 having an amino acid residue which differs from that of hFSH-(3 in the re- 
moval of at least one lysine residue selected from the group consisting of K14(b), K40(b), 
K46(b), K49(b), K54(b), K86(b), and Kl 10(b), in particular at least one amino acid residue 
selected from of the group consisting of K14(b), K40(b), K46(b), K49(b), K54(b), K86(b), 
and Kl 10(b) (these residues having more than 25% of their side chain exposed to the surface), 
and preferably from the group consisting of K46(b), K54(b), K86(b), and Kl 10(b) (these resi- 
dues having more than 50% of their side chain exposed to the surface). The FSH-oc part of this 
conjugate may be hFSH-oc or any of the modified FSH-oc polypeptides described herein. 

In a further embodiment, the conjugate of the invention is one which comprises 
a modified FSH-oc and a modified FSH-(3, each of which differ from the corresponding hFSH 
subunit in the removal of at least one of the above identified lysine residues. For instance, the 
conjugate of the invention may be one wherein the modified FSH-oc and modified FSH-3 sub- 
unit differ from the corresponding hFSH subunit in at least one of K45(a), K63(a), K75(a), 
and K91(a) and at least one of K46(b), K54(b), K86(b), and Kl 10(b). 

The removal of any of the above lysine residues is preferably achieved by sub- 
stitution by any other amino acid residue, in particular by an arginine or a glutamine residue. 

ii) Introduction of lysine residues 

In order to obtain a more extensive conjugation it may be desirable to introduce 
at least one non-naturally occurring lysine residue in hFSH, in particular in a position occu- 
pied by an amino acid residue having a side chain which is more than 25% surface exposed 
and which is not part of a cystine or located at a receptor binding site. 

Accordingly, in a further embodiment the conjugate of the invention is one 
which comprises a modified FSH-oc having an amino acid residue which differs from that of 
hFSH-a in the introduction of at least one lysine residue in a position selected from the group 
consisting of Al(a), P2(a), D3(a), V4(a), Q5(a), D6(a), P8(a), E9(a), Tll(a), L12(a), Q13(a), 
E14(a), P16(a), F17(a), Q20(a), P21(a), G22(a), A23(a), P24(a), L26(a), M29(a), F33(a), 
R42(a), S43(a), T46(a), L48(a), V49(a), Q50(a), N52(a), V61(a), S64(a), Y65(a), N66(a), 
R67(a), V68(a), T69(a), M71(a), G72(a), G73(a), F74(a), N78(a), T80(a), A81(a), H83(a), 
S85(a), T86(a), Y88(a), Y89(a), H90(a), and S92(a), in particular selected from of the group 
consisting of Al(a), P2(a), D3(a), V4(a), Q5(a), D6(a), P8(a), E9(a), Tll(a), Q13(a), E14(a), 
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P16(a), F17(a), Q20(a), P21(a), G22(a), A23(a), T46(a), L48(a), V49(a), Q50(a), N52(a), 
S64(a), N66(a), R67(a), T69(a), G72(a), G73(a), T86(a), Y89(a), H90(a), and S92(a) (these 
residues having more than 50% of their side chain exposed to the surface), and most prefera- 
bly in the position R42(a) and/or R67(a), such as R67(a). The FSH-p part of this conjugate 
may be hFSH-P or any of the modified FSH-(3 polypeptides described herein. 

In a further embodiment the conjugate of the invention is one which comprises 
a modified FSH-P having an amino acid residue which differs from that of hFSH-(3 in the in- 
troduction of at least one lysine residue in a position selected from the group consisting of 
Nl(b), S2(b), E4(b), L5(b), T6(b), N7(b), 18(b), T9(b), E15(b), E16(b), R18(b), F19(b), 
121(b), S22(b), N24(b), Y31(b), Y33(b), R35(b), D36(b), L37(b), Y39(b), D41(b), P42(b), 
A43(b), R44(b), P45(b), 147(b), E55(b), L56(b), V57(b), Y58(b), E59(b), T60(b), V61(b), 
R62(b), P64(b), G65(b), A67(b), H68(b), H69(b), D71(b), L73(b), Y74(b), T75(b), T80(b), 
Q81(b), H83(b), G85(b), D88(b), S89(b), D90(b), S91(b), D93(b), T95(b), V96(b), R97(b), 
G98(b), L99(b), G100(b), Y103(b), S 105(b), F106(b), G107(b), E108(b), M109(b), and 
El 11(b), in particular selected from of the group consisting of Nl(b), N7(b), T9(b), E15(b), 
E16(b), R18(b), F19(b), N24(b), Y33(b), D41(b), P42(b), A43(b), R44(b), P45(b), 147(b), 
E55(b), V57(b), Y58(b), E59(b), R62(b), P64(b), G65(b), A67(b), H68(b), H69(b), D71(b), 
L73(b), T75(b), Q81(b), H83(b), D88(b), S89(b), D90(b), S91(b), T95(b), R97(b), G98(b), 
L99(b), G100(b), Y103(b), S105(b), F106(b), G107(b), E108(b), M109(b), andElll(b) (these 
residues having more than 50% of their side chain exposed to the surface), and most prefera- 
bly selected from the group consisting of R18(b), R35(b), R44(b), R62(b), and R97(b), such 
R18(b), R44(b), R62(b), and R97(b). The FSH-oc part of this conjugate may be hFSH-a or any 
of the modified FSH-cc polypeptides described herein. 
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Hi) Introduction and removal of lysine residues 

The conjugate of the invention may comprise at least one introduced lysine 
residue, in particular any of those described in the section entitled "Introduction of lysine resi- 
dues", and at least one removed lysine residue, in particular any of those described in the sec- 

5 tion entitled "Removal of lysine residues". 

Preferably, the conjugate comprises a modified FSH-oc and/or a modified FSH- 
p which differs from the corresponding hFSH-a/p in at least one introduced and at least one 
removed lysine residue, wherein the lysine residue is introduced by substitution of an amino 
acid residue selected from the group consisting of R42(a) and R67(a), R18(b), R35(b), 

10 R44(b), R62(b), and R97(b), and more preferably from the group consisting of R67(a), 

R18(b), R44(b), R62(b), and R97(b) and removal of a lysine residue selected from the group 
consisting of K45(a), K63(a), K75(a), K91(a) K46(b), K54(b), K86(b), and Kl 10(b), the re- 
moval preferably being achieved by substitution by any other amino acid residue, in particular 
by an arginine residue. 

15 

N-terminal PEGvlation of FSH 

As indicated above, one aspect of the invention relates to a polypeptide conju- 
gate wherein at least one of the FSH-oc and FSH-J3 subunits comprises a polymer molecule 
bound to the N-terminal thereof. Preferably, the polymer is a polyethylene glycol (PEG) such 

20 as mPEG; see the general discussion below regarding conjugates comprising polyethylene 
glycol-derived polymers. 

In the case of N-terminal PEGylated FSH conjugates according to the inven- 
tion, the respective subunits may comprise one or more of the modifications disclosed else- 
where herein, or one or both of the subunits may be the respective wildtype subunits with a 

25 PEG-derived polymer being attached at the N-terminal. Thus, the polypeptide conjugate may 
be one in which the FSH-a subunit comprises hFSH-a having the sequence shown in SEQ ID 
NO:2, and/or in which the FSH-(3 subunit comprises hFSH-(3 having the sequence shown in 
SEQ ID NO:4. In one embodiment, both of the subunits correspond to the respective wildtype 
hFSH subunits, although with either the a or (3 subunit, or both, being N-terminally PEGy- 

30 lated. In a preferred embodiment, however, at least one glycosylation site has been introduced 
into one or both of the subunits as described in detail above. In cases where at least one of the 
subunits has an N-terminally attached PEG molecule, it will often be desirable that no other 
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PEG molecules are attached, e.g., to a lysine residue. In such cases, the polypeptide conjugate 
will thus comprise either one or two N-terminally attached PEG molecules as the sole poly- 
mer molecule(s). 

Aldehyde-activated PEG and reduction using NaBH 3 CN have been used to se~ 
5 lectively pegylate the N-terminal a-amino group of proteins (see for instance US 5,824,784 
regarding N-terminal PEGylation of G-CSF). The N-terminus of the a and/or the P chain of 
wildtype FSH or a modified form of FSH can be PEGylated using similar methods. Reaction 
materials include purified FSH or a modified form of FSH, methoxy-PEG-aldehyde (M-PEG- 
CHO), and NaBH 3 CN. In order to optimise yield, one may for instance vary: molar ratio of 
10 FSH, M-PEG-CHO and NaBH 3 CN, time for establishment of the Schiffs base equilibrium 
(reaction between FSH and M-PEG-CHO before addition of NaBH 3 CN), reaction time after 
addition of NaBH 3 CN, temperature, pH, or reaction volume. The yield of PEGylated FSH 
forms may be analysed using Western blotting, mass spectrometry and N-terminal sequenc- 
ing. In order to restrict PEGylation to only one of the two N-termini in FSH, PEGylation of 
15 the a or P chain may be selectively prevented by addition of a glutamine to the N-terminus. 
Spontaneous cyclisation of such an N-terminal glutamine residue will render it unaccessible 
for PEGylation. Such a glutamine residue may subsequently be removed using a pyroglutamyl 
aminopeptidase (for instance EC 3.4.19.3). 

20 Conjugate of the invention having a non-lvsine residue as an attachment group 

Based on the present disclosure the skilled person will be aware that amino 
acid residues comprising other attachment groups may be introduced into and/or removed 
from FSH-a and/or FSH-|3, using the same approach as that illustrated above by lysine resi- 
dues. For instance, one or more amino acid residues comprising an acid group (glutamic acid 

25 or aspartic acid), asparagine, tyrosine or cysteine may be introduced into positions which in 
hFSH are occupied by amino acid residues having surface exposed side chains (i.e. the posi- 
tions mentioned above as being of interest for introduction of lysine residues), or removed. As 
described above, introduction or removal of such amino acid residues is preferably performed 
by substitution. Preferably, Asp is substituted by Asn, Glu by Gin, Tyr by Phe, and Cys by 

30 Ser. Another possibility is introduction and/or removal of a histidine, e.g. by substitution with 
arginine. 
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Non-polvpeptide moiety of the conjugate of the invention 

As indicated above, the non-polypeptide moiety of the conjugate of the inven- 
tion is preferably selected from the group consisting of a polymer molecule, a lipophilic com- 
pound, an oligosaccharide moiety (by way of in vivo glycosylation) and an organic derivatiz- 

5 ing agent. All of these agents may confer desirable properties to the polypeptide part of the 
conjugate, in particular an increased functional in vivo half -life and/or an increased serum 
half-life. The polypeptide part of the conjugate is often conjugated to only one type of non- 
polypeptide moiety, but may also be conjugated to two or more different types of non- 
polypeptide moieties, e.g. to a polymer molecule and an oligosaccharide moiety, to a lipo- 

10 philic group and an oligosaccharide moiety, to an organic derivatizing agent and an oligosac- 
charide moiety, to a lipophilic group and a polymer molecule, etc. The conjugation to two or 
more different non-polypeptide moieties may be done simultaneously or sequentially. In a 
preferred embodiment of a polypeptide conjugated to different types of non-polypeptide moie- 
ties, the polypeptide is conjugated to one or more oligosaccharide moieties by in vivo glycosy- 

15 lation, and to one or more polymer molecules, preferably PEG, more preferably at an N- 
terminal, by conjugation in vitro. 

Polypeptide of the invention 

In a further aspect the invention relates to a modified FSH-a or a modified 
20 FSH-P polypeptide constituting part of a conjugate of the invention. The modified FSH-a and 
FSH-P are preferably glycosylated and thus further comprise N-linked and/or O-linked oligo- 
saccharide moieties. Specific modified FSH-a and FSH-(3 polypeptides of the invention are 
those described in the section entitled "Conjugate of the invention". 

25 Methods of preparing a conjugate of the invention 

In the following sections "Conjugation to an oligosaccharide moiety", 
"Conjugation to a polymer molecule", "Conjugation to a lipophilic compound" and 
"Conjugation to an organic derivatizing agent", conjugation to specific types of non- 
polypeptide moieties is described. 



30 
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Coupling to an oligosaccharide moiety 

For in vivo glycolyslation, conjugation to an oligosaccharide moiety takes place 
by means of a glycosylating, eucaryotic expression host. The expression host cell may be se- 
lected from fungal (filamentous fungal or yeast), insect or animal cells or from transgenic 
5 plant cells. In one embodiment the host cell is a mammalian cell, such as a CHO cell, e.g. 
CHO Kl, a BHK or HEK cell, e.g. HEK 293, an insect cell such as an SF9 cell, or a yeast 
cell, e.g. 5. cerevisiae or Pichia pastoris, or any of the host cells mentioned hereinafter. Pre- 
ferred cells for expression of an in vivo glycosylated protein of the invention are mammalian 
cells, in particular CHO cells. 

10 

Conjugation to a polymer molecule 

The polymer molecule to be coupled to the polypeptide may be any suitable 
polymer molecule, such as a natural or synthetic homo-polymer or hetero-polymer, typically 
with a molecular weight in the range of 300-50,000 Da, such as 500-20,000 Da, more prefera- 

15 bly in the range of 1000-15,000 Da, such as in the range of 1000-12,000 Da or 2000-10,000 
Da. Examples of homo-polymers include a polyol (i.e. poly-OH), a polyamine (i.e. poly-NH 2 ) 
and a polycarboxylic acid (i.e. poly-COOH). A hetero-polymer is a polymer which comprises 
different coupling groups, such as a hydroxyl group and an amine group. 

Examples of suitable polymer molecules include polymer molecules selected 

20 from the group consisting of polyalkylene oxide (PAO), including polyalkylene glycol (PAG), 
such as polyethylene glycol (PEG) and polypropylene glycol (PPG), branched PEGs, poly- 
vinyl alcohol (PVA), poly-carboxylate, poly-(vinylpyrolidone), polyethylene-co-maleic acid 
anhydride, polystyrene-co-maleic acid anhydride, dextran, including carboxymethyl-dextran, 
or any other biopolymer suitable for reducing immunogenicity and/or increasing functional in 

25 vivo half-life and/or serum half-life. Another example of a polymer molecule is human albu- 
min or another abundant plasma protein. Generally, polyalkylene glycol-derived polymers are 
biocompatible, non-toxic, non-antigenic, non-immunogenic, have various water solubility 
properties, and are easily excreted from living organisms. 

PEG is the preferred polymer molecule, since it has only few reactive groups 

30 capable of cross-linking compared to e.g. polysaccharides such as dextran. In particular, mon- 
ofunctional PEG, e.g. methoxypolyethylene glycol (mPEG), is of interest since its coupling 
chemistry is relatively simple (only one reactive group is available for conjugating with at- 
tachment groups on the polypeptide). Consequently, the risk of cross-linking is eliminated, the 
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resulting polypeptide conjugates are more homogeneous and the reaction of the polymer 
molecules with the polypeptide is easier to control. 

To effect covalent attachment of the polymer molecule(s) to the polypeptide, 
the hydroxyl end groups of the polymer molecule must be provided in activated form, i.e. with 

5 reactive functional groups. Suitable activated polymer molecules are commercially available, 
e.g. from Shearwater Polymers, Inc., Huntsville, AL, USA, or from PolyMASC Pharmaceuti- 
cals pic, UK. Alternatively, the polymer molecules can be activated by conventional methods 
known in the art, e.g. as disclosed in WO 90/13540. Specific examples of activated linear or 
branched polymer molecules for use in the present invention are described in the Shearwater 

10 Polymers, Inc. 1997 and 2000 Catalogs (Functionalized Biocompatible Polymers for Research 
and Pharmaceuticals, Polyethylene Glycol and Derivatives, incorporated herein by reference). 
Specific examples of activated PEG polymers include the following linear PEGs: NHS -PEG 
(e.g. SPA-PEG, SSPA-PEG, SBA-PEG, SS-PEG, SSA-PEG, SC-PEG, SG-PEG, and SCM- 
PEG), and NOR-PEG), BTC-PEG, EPOX-PEG, NCO-PEG, NPC-PEG, GDI-PEG, ALD- 

15 PEG, TRES-PEG, VS-PEG, IODO-PEG, and MAL-PEG, and branched PEGs such as PEG2- 
NHS and those disclosed in US 5,932,462 and US 5,643,575, both of which are incorporated 
herein by reference. Furthermore, the following publications, incorporated herein by refer- 
ence, disclose useful polymer molecules and/or PEGylation chemistries: US 5,824,778, US 
5,476,653, WO 97/32607, EP 229,108, EP 402,378, US 4,902,502, US 5,281,698, US 

20 5,122,614, US 5,219,564, WO 92/16555, WO 94/04193, WO 94/14758, WO 94/17039, WO 
94/18247, WO 94/28024, WO 95/00162, WO 95/11924, WO95/13090, WO 95/33490, WO 
96/00080, WO 97/18832, WO 98/41562, WO 98/48837, WO 99/32134, WO 99/32139, WO 
99/32140, WO 96/40791, WO 98/32466, WO 95/06058, EP 439 508, WO 97/03106, WO 
96/21469, WO 95/13312, EP 921 131, US 5,736,625, WO 98/05363, EP 809 996, US 

25 5,629,384, WO 96/41813, WO 96/07670, US 5,473,034, US 5,516,673, EP 605 963, US 
5,382,657, EP 510 356, EP 400 472, EP 183 503 andEP 154 316. 

The conjugation of the polypeptide and the activated polymer molecules is conducted by use 
of any conventional method, e.g. as described in the following references (which also describe 
suitable methods for activation of polymer molecules): R.F. Taylor, (1991), "Protein immobi- 
30 lisation. Fundamental and applications", Marcel Dekker, N.Y.; S.S. Wong, (1992), "Chemis- 
try of Protein Conjugation and Crosslinking", CRC Press, Boca Raton; G.T. Hermanson et al., 
(1993), "Immobilized Affinity Ligand Techniques", Academic Press, N.Y.). The skilled per- 
son will be aware that the activation method and/or conjugation chemistry to be used depends 
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on the attachment group(s) of the polypeptide (examples of which are given further above), as 
well as the functional groups of the polymer (e.g. being amine, hydroxyl, carboxyl, aldehyde, 
sulfydryl, succinimidyl, maleimide, vinysulfone or haloacetate). The PEGylation may be di- 
rected towards conjugation to all available attachment groups on the polypeptide (i.e. such 

5 attachment groups that are exposed at the surface of the polypeptide) or may be directed to- 
wards one or more specific attachment groups, e.g. the N-terminal amino group (US 
5,985,265). Furthermore, the conjugation may be achieved in one step or in a stepwise manner 
(e.g. as described in WO 99/55377). 

It will be understood that the PEGylation is designed so as to produce the opti- 

10 mal molecule with respect to the number of PEG molecules attached, the size and form of 
such molecules (e.g. whether they are linear or branched), and where in the polypeptide such 
molecules are attached. The molecular weight of the polymer to be used will be chosen taking 
into consideration the desired effect to be achieved. For instance, if the primary purpose of the 
conjugation is to achieve a conjugate having a high molecular weight and larger size (e.g. to 

15 reduce renal clearance), one may choose to conjugate either one or a few high molecular 

weight polymer molecules or a number of polymer molecules with a smaller molecular weight 
to obtain the desired effect. For epitope shielding, a sufficiently high number (e.g. 2-8, such as 
3-6) of low molecular weight polymer molecules (e.g. with a molecular weight of about 5,000 
Da) may be used to effectively shield all or most epitopes of the polypeptide. 

20 When the protein is conjugated to only a single polymer molecule, for example 

where an N-terminal PEG molecule is the only polymer molecule, it will often be advanta- 
geous that the polymer molecule, which may be linear or branched, has a relatively high mo- 
lecular weight, e.g. about 12-20 kDa. 

In a specific embodiment, the polypeptide conjugate of the invention comprises 

25 a PEG molecule attached to most or substantially all of the lysine residues in the polypeptide 
available for PEGylation, in particular a linear or branched PEG molecule, e.g. with a molecu- 
lar weight of about 5 kDa. In this case, it will normally be desirable to remove one or more of 
the lysines present in wildtype hFSH-oe or hFSH-(3 in order to provide a more limited number 
of attachment sites and obtain a desired distribution of the PEG molecules. The polypeptide 

30 conjugate may further comprise a PEG molecule attached to the N-terminal amino acid resi- 
due in addition to the lysine residues. 

Normally, the polymer conjugation is performed under conditions aiming at re- 
acting as many of the available polymer attachment groups as possible with polymer mole- 
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cules. This is achieved by means of a suitable molar excess of the polymer in relation to the 
polypeptide. Typical molar ratios of activated polymer molecules to polypeptide are up to 
about 1000-1, such as up to about 200-1 or up to about 100-1. In some cases, the ratio may be 
somewhat lower, however, such as up to about 50-1, 10-1 or 5-1. 

5 It is also contemplated according to the invention to couple the polymer mole- 

cules to the polypeptide through a linker. Suitable linkers are well known to the skilled per- 
son. A preferred example is cyanuric chloride (Abuchowski et al., (1977), /. Biol Chem., 252, 
3578-3581; US 4,179,337; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. 24, 375-378. 

Subsequent to the conjugation residual activated polymer molecules are 

10 blocked according to methods known in the art, e.g. by addition of primary amine to the reac- 
tion mixture, and the resulting inactivated polymer molecules removed by a suitable method. 

Covalent in vitro coupling of carbohydrate moieties glycosides (such as dex- 
tran) to amino acid residues of the polypeptide may also be used, e.g. as described in WO 
87/05330 and in Aplin et al., CRC Crit Rev. Biochem., pp. 259-306, 1981. The in vitro cou- 

15 pling of carbohydrate moieties or PEG to protein- and peptide-bound Gin-residues can be 
carried out by transglutaminases (TGases), Transglutaminases catalyse the transfer of donor 
amine groups to protein- and peptide-bound Gin residues in a so-called cross-linking reaction. 
The donor amine groups can be protein- or peptide-bound e.g. as the e-atnino group in Lys 
residues or can be part of a small or large organic molecule. An example of a small organic 

20 molecule functioning as amino-donor in TGase-catalysed cross-linking is putrescine (1,4- 
diaminobutane). An example of a larger organic molecule functioning as amino-donor in 
TGase-catalysed cross-linking is an amine-containing PEG (Sato et al., Biochemistry 35, 
13072-13080). 

TGases, in general, are highly specific enzymes, and not every Gin residue ex- 
25 posed on the surface of a protein is accessible to TGase-catalysed cross-linking to amino- 
containing substances. On the contrary, only a few Gin residues function naturally as TGase 
substrates but the exact parameters governing which Gin residues are good TGase substrates 
remain unknown. Thus, in order to render a protein susceptible to TGase-catalysed cross- 
linking reactions it is often a prerequisite at convenient positions to add stretches of amino 
30 acid sequence known to function very well as TGase substrates. Several amino acid sequences 
are known to be or to contain excellent natural TGase substrates e.g. substance P, elafin, fi- 
brinogen, fibronectin, a 2 -plasmin inhibitor, a-caseins, and (3-caseins. 
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Conjugation to a lipophilic compound 

The polypeptide and the lipophilic compound may be conjugated to each other 
either directly or by use of a linker. The lipophilic compound may be a natural compound 
such as a saturated or unsaturated fatty acid, a fatty acid diketone, a terpene, a prostaglandin, a 
5 vitamin, a carotenoid or steroid, or a synthetic compound such as a carbon acid, an alcohol, an 
amine and sulphonic acid with one or more alkyl, aryl, alkenyl or other multiple unsaturated 
compounds. The conjugation between the polypeptide and the lipophilic compound, option- 
ally through a linker, may be done according to methods known in the art, e.g. as described by 
Bodanszky in Peptide Synthesis, John Wiley, New York, 1976 and in WO 96/12505. 

10 

Coupling to an organic derivatizing agent 

Covalent modification of the polypeptide exhibiting FSH activity may be per- 
formed by reacting one or more attachment groups of the polypeptide with an organic derivat- 
izing agent. Suitable derivatizing agents and methods are well known in the art. For example, 

15 cysteinyl residues most commonly are reacted with a-haloacetates (and corresponding 

amines), such as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyami- 
domethyl derivatives. Cysteinyl residues also are derivatized by reaction with bromo- 
trifluoroacetone, a-bromo~P-(4-imidozoyl)propionic acid, chloroacetyl phosphate, N- 
alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p- 

20 chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7~nitrobenzo-2-oxa-l,3- 
diazole. Histidyl residues are derivatized by reaction with diethylpyrocarbonateat, pH 5.5-7.0, 
because this agent is relatively specific for the histidyl side chain. Para-bromophenacyl bro- 
mide is also useful. The reaction is preferably performed in 0.1 M sodium cacodylate at pH 
6.0. Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid 

25 anhydrides. Derivatization with these agents has the effect of reversing the charge of the 

lysinyl residues. Other suitable reagents for derivatizing a-amino-containing residues include 
imidoesters such as methyl picolinimidate, pyridoxal phosphate, pyridoxal, chloroboro- 
hydride, trinitrobenzenesulfonic acid, O-methylisourea, 2,4-pentanedione and transaminase- 
catalyzed reaction with glyoxylate. Arginyl residues are modified by reaction with one or sev- 

30 eral conventional reagents, among them phenylglyoxal, 2,3-butanedione, 1,2- 

cyclohexanedione, and ninhydrin. Derivatization of arginine residues requires that the reaction 



WO 01/58493 PCT/DK01/00090 

35 

be performed under alkaline conditions because of the high pKa of the guanidine functional 
group. 

Furthermore, these reagents may react with the groups of lysine as well as the 
arginine guanidino group. Carboxyl side groups (aspartyl or glutamyl) are selectively modi- 
5 fied by reaction with carbodiimides (R-N=C=N-R'), where R and R' are different alkyl 

groups, such as l-cyclohexyl-3-(2-morpholinyl-4-ethyl) carbodiimide or l-ethyl-3-(4-azonia- 
4,4-dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted 
to asparaginyl and glutaminyl residues by reaction with ammonium ions. 



10 Blocking of a functional site 

It has been reported that excessive polymer conjugation can lead to a loss of ac- 
tivity of the polypeptide to which the polymer is conjugated. This problem can be eliminated 
by e.g. removal of attachment groups located at the functional site or by blocking the func- 
tional site prior to conjugation. The latter strategy constitutes a further embodiment of the 

15 invention (the first strategy being exemplified further above, e.g. by removal of lysine resi- 
dues which may be located close to the functional site). More specifically, according to the 
second strategy the conjugation between the polypeptide and the non-polypeptide moiety is 
conducted under conditions where the functional site of the polypeptide is blocked by a helper 
molecule capable of binding to the functional site of the polypeptide. 

20 Preferably, the helper molecule is one which specifically recognizes a func- 

tional site of the polypeptide, such as a receptor, in particular the FSH receptor or a part of the 
FSH receptor. Alternatively, the helper molecule may be an antibody, in particular a mono- 
clonal antibody recognizing the polypeptide exhibiting FSH activity. In particular, the helper 
molecule may be a neutralizing monoclonal antibody. 

25 The polypeptide is allowed to interact with the helper molecule before effecting 

conjugation. This ensures that the functional site of the polypeptide is shielded or protected 
and consequently unavailable for derivatization by the non-polypeptide moiety such as a 
polymer. Following its elution from the helper molecule, the conjugate between the non- 
polypeptide moiety and the polypeptide can be recovered with at least a partially preserved 

30 functional site. 

The subsequent conjugation of the polypeptide having a blocked functional site 
to a polymer, a lipophilic compound, an oligosaccharide moiety, an organic derivatizing agent 
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or any other compound is conducted in the normal way, e.g. as described in the sections above 
entitled "Conjugation to 

Irrespective of the nature of the helper molecule to be used to shield the func- 
tional site of the polypeptide from conjugation, it is desirable that the helper molecule is free 

5 of or comprises only a few attachment groups for the non-polypeptide moiety of choice in any 
parts of the molecule where the conjugation to such groups will hamper the desorption of the 
conjugated polypeptide from the helper molecule. Hereby, selective conjugation to attachment 
groups present in non-shielded parts of the polypeptide can be obtained and it is possible to 
reuse the helper molecule for repeated cycles of conjugation. For instance, if the non- 

10 polypeptide moiety is a polymer molecule such as PEG which has the epsilon amino group of 
a lysine or N-terminal amino acid residue as an attachment group, it is desirable that the 
helper molecule is substantially free of conjugatable epsilon amino groups, preferably free of 
any epsilon amino groups. Accordingly, in a preferred embodiment the helper molecule is a 
protein or peptide capable of binding to the functional site of the polypeptide, which protein 

15 or peptide is free of any conjugatable attachment groups for the non-polypeptide moiety of 
choice. 

In a further embodiment the helper molecule is first covalently linked to a solid 
phase such as column packing materials, for instance Sephadex or agarose beads, or a surface, 
e.g. a reaction vessel. Subsequently, the polypeptide is loaded onto the column material carry- 

20 ing the helper molecule and conjugation carried out according to methods known in the art, 
e.g. as described in the sections above entitled "Conjugation to . . This procedure allows 
the polypeptide conjugate to be separated from the helper molecule by elution. The polypep- 
tide conjugate is eluated by conventional techniques under physico-chemical conditions that 
do not lead to a substantive degradation of the polypeptide conjugate. The fluid phase con- 

25 taining the polypeptide conjugate is separated from the solid phase to which the helper mole- 
cule remains covalently linked. The separation can be achieved in other ways: For instance, 
the helper molecule may be derivatised with a second molecule (e.g. biotin) that can be recog- 
nized by a specific binder (e.g. streptavidin). The specific binder may be linked to a solid 
phase thereby allowing the separation of the polypeptide conjugate from the helper molecule- 

30 second molecule complex through passage over a second helper-solid phase column which 
will retain, upon subsequent elution, the helper molecule-second molecule complex, but not 
the polypeptide conjugate. The polypeptide conjugate may be released from the helper mole- 
cule in any appropriate fashion. Deprotection may be achieved by providing conditions in 
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which the helper molecule dissociates from the functional site of the FSH to which it is 
bound. For instance, a complex between an antibody to which a polymer is conjugated and an 
anti-idiotypic antibody can be dissociated by adjusting the pH to an acid or alkaline pH. 

5 Conjugation of a tagged polypeptide 

In an alternative embodiment the polypeptide is expressed as a fusion protein 
with a tag, i.e. an amino acid sequence or peptide stretch made up of typically 1-30, such as 
1-20 amino acid residues. Besides allowing for fast and easy purification, the tag is a conven- 
ient tool for achieving conjugation between the tagged polypeptide and the non-polypeptide 

10 moiety. In particular, the tag may be used for achieving conjugation in microtiter plates or 
other carriers, such as paramagnetic beads, to which the tagged polypeptide can be immobi- 
lised via the tag. The conjugation to the tagged polypeptide in, e.g., microtiter plates has the 
advantage that the tagged polypeptide can be immobilised in the microtiter plates directly 
from the culture broth (in principle without any purification) and subjected to conjugation. 

15 Thereby, the total number of process steps (from expression to conjugation) can be reduced. 
Furthermore, the tag may function as a spacer molecule ensuring an improved accessibility to 
the immobilised polypeptide to be conjugated. The conjugation using a tagged polypeptide 
may be to any of the non-polypeptide moieties disclosed herein, e.g. to a polymer molecule 
such as PEG. 

20 The identity of the specific tag to be used is not critical as long as the tag is ca- 

pable of being expressed with the polypeptide and is capable of being immobilised on a suit- 
able surface or carrier material. A number of suitable tags are commercially available, e.g. 
from Unizyme Laboratories, Denmark. For instance, the tag may consist of any of the follow- 
ing sequences: 
25 His-His-His-His-His-His 

Met-Lys-His-His-His-His-His-His 
Met-Lys-His-His-Ala-His-His-Gln-His-His 
Met-Lys-His-Gln-ffis-Gln-ffis-Gln-His-Gln-His-Gln-ffis-Gln 
Met-Lys-His-Gln-His-Gln-His-Gln-His-Gln-His-Gln-His-Gln-Gln 
30 or any of the following: 

EQKLI SEEDL (a C-terminal tag described in Mot Cell Biol 5:3610-16, 

1985) 

D YKDDDDK (a C- or N-terminal tag) 
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YPYDVPDYA 

Antibodies against the above tags are commercially available, e.g. from ADI, 
Aves Lab and Research Diagnostics. 

The subsequent cleavage of the tag from the polypeptide may be achieved by 
5 use of commercially available enzymes. 

Methods for preparing a polypeptide of the invention or the polypeptide of the conjugate of 
the invention 

The polypeptide of the present invention or the polypeptide part of a conjugate 

10 of the invention, optionally in glycosylated form, may be produced by any suitable method 
known in the art. Such methods include constructing a nucleotide sequence encoding the 
polypeptide and expressing the sequence in a suitable transformed or transfected host. Poly- 
peptides of the invention may also be produced, albeit less efficiently, by chemical synthesis 
or a combination of chemical synthesis and recombinant DNA technology. 

15 FSH~oc and FSH-p are preferably expressed by the same host cell, thus becom- 

ing dimerized in vivo prior to purification and possible in vitro conjugation to a non- 
polypeptide moiety. Co-expression of FSH-oc and FSH-|3 in CHO cells is e.g. described by 
Keene et al., J Biol Chem 1989 25; 264(9): 4769-75. Alternatively, the polypeptide may be 
expressed as a single-chain polypeptide wherein the nucleotide sequences encoding FSH-oc 

20 and FSH-|3 are fused, either directly or using a suitable peptide linker, and expressed as a sin- 
gle-chain polypeptide using a similar approach to that described in US 5,883,073 or WO 
96/05224. It will thus be understood that the polypeptide of the invention may comprise the 
FSH-oc and FSH-(3 subunits in the form of two separate polypeptide chains, where the two 
chains become dimerized in vivo so as to form a dimeric polypeptide, or it may comprise a 

25 single-chain construct comprising the two subunits covalently linked by a peptide bond or a 
peptide linker. 

In an alternative embodiment, two FSH-P subunits, wherein at least one of the 
two P subunits is modified as described herein, preferably by introduction of at least one N- or 
O-glycosylation site, may be expressed as a single-chain polypeptide in which the subunits are 
30 either fused directly or via a peptide linker. Similarly, two FSH-oc subunits, wherein at least 
one of the two a subunits is modified as described herein, may also be expressed as a single- 
chain polypeptide with the subunits fused directly or via a peptide linker. Further, it is also 
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possible to produce single-chain constructs comprising more than two subunits, e.g. three 
subunits, wherein at least one of the individual subunits is modified as described herein, and 
wherein the subunits are fused to each other directly or via a peptide linker. For example, a 
single-chain construct having the sequence FSHcc-FSH|3-FSH|3, FSHP-FSHa-FSHp or FSH(3- 

5 FSH(3-FSHa, wherein the p subunits in each construct are the identical or different, may be 
produced using techniques known in the art. Single-chain constructs of this general type are 
disclosed in US 5,705,478, US 5,883,073, WO 99/25489 and WO 96/05224. 

For single-chain constructs, the linker peptide will often predominantly include 
the amino acid residues Gly, Ser, Ala and/or Thr. Such a linker typically comprises 1-30 

10 amino acid residues, such as a sequence of about 2-20 or 3-15 amino acid residues. The amino 
acid residues selected for inclusion in the linker peptide should exhibit properties that do not 
interfere significantly with the activity of the polypeptide. Thus, the linker peptide should on 
the whole not exhibit a charge which would be inconsistent with the desired FSH activity, or 
interfere with internal folding, or form bonds or other interactions with amino acid residues in 

15 one or more of the subunits which would seriously impede the binding of the dimeric or mul- 
timeric polypeptide to the receptor. 

Specific linkers for use in the present invention may be designed on the basis of 
known naturally occurring as well as artificial polypeptide linkers (see, e.g., Hallewell et al. 
(1989), Biol Chem. 264, 5260-5268; Alfthan et al. (1995), Protein Eng. 8, 725-731; 

20 Robinson & Sauer (1996), Biochemistry 35, 109-116; Khandekar et al. (1997), /. Biol Chem. 
272, 32190-32197; Fares et al. (1998), Endocrinology 139, 2459-2464; Smallshaw et al. 
(1999), Protein Eng. 12, 623-630; US 5,856,456). For instance, linkers used for creating sin- 
gle-chain antibodies, e.g. a 15mer consisting of three repeats of a Gly-Gly-Gly-Gly-Ser amino 
acid sequence ((Gly 4 Ser) 3 ), are contemplated to be useful. Furthermore, phage display tech- 

25 nology as well as selective infective phage technology can be used to diversify and select ap- 
propriate linker sequences (Tang et al., J. Biol. Chem. 271, 15682-15686, 1996; Hennecke et 
al. (1998), Protein Eng. 11, 405-410). Also, Arc repressor phage display has been used to 
optimize the linker length and composition for increased stability of a single-chain protein 
(Robinson and Sauer (1998), Proc. Natl Acad. Set USA 95, 5929-5934). Another way of ob- 

30 taining a suitable linker is by optimizing a simple linker, e.g. ((Gly4Ser) n ), through random 
mutagenesis. The linker may e.g. be (Gly 4 Ser) n or (Gly 3 Ser) n where n is 1, 2, 3 or 4. 

The nucleotide sequence encoding FSH-oc or FSH-(3 modified according to the 
invention may be constructed by isolating or synthesizing a nucleotide sequence encoding the 
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parent FSH subunit, such as hFSH-a or hFSH-(3 with the amino acid sequence shown in SEQ 
ID NO: 2 or 4, respectively, or the precursor form thereof (shown in SEQ ID NO:l and 3, re- 
spectively) and then changing the nucleotide sequence so as to effect introduction (i.e. inser- 
tion or substitution) or deletion (i.e. removal or substitution) of the relevant amino acid resi- 
5 due(s). The nucleotide sequence is conveniently modified by site-directed mutagenesis in ac- 
cordance with conventional methods. Alternatively, the nucleotide sequence may be prepared 
by chemical synthesis, e.g. by using an oligonucleotide synthesizer, wherein oligonucleotides 
are designed based on the amino acid sequence of the desired polypeptide, and preferably 
selecting those codons that are favored in the host cell in which the recombinant polypeptide 

10 will be produced. For example, several small oligonucleotides coding for portions of the de- 
sired polypeptide may be synthesized and assembled by PGR, ligation or ligation chain reac- 
tion (LCR) (Barany, PNAS 88:189-193, 1991). The individual oligonucleotides typically con- 
tain 5' or 3 5 overhangs for complementary assembly. 

Once assembled (by synthesis, site-directed mutagenesis or another method), 

15 the nucleotide sequence encoding the polypeptide is inserted into a recombinant vector and 
operably linked to control sequences necessary for expression of the FSH in the desired trans- 
formed host cell. 

It should of course be understood that not all vectors and expression control se- 
quences function equally well to express the nucleotide sequence encoding a polypeptide de- 

20 scribed herein. Neither will all hosts function equally well with the same expression system. 
However, one of skill in the art may make a selection among these vectors, expression control 
sequences and hosts without undue experimentation. For example, in selecting a vector, the 
host must be considered because the vector must replicate in it or be able to integrate into the 
chromosome. The vector's copy number, the ability to control that copy number, and the ex- 

25 pression of any other proteins encoded by the vector, such as antibiotic markers, should also 
be considered. In selecting an expression control sequence, a variety of factors should also be 
considered. These include, for example, the relative strength of the sequence, its controllabil- 
ity, and its compatibility with the nucleotide sequence encoding the polypeptide, particularly 
as regards potential secondary structures. Hosts should be selected by consideration of their 

30 compatibility with the chosen vector, the toxicity of the product coded for by the nucleotide 
sequence, their secretion characteristics, their ability to fold the polypeptide correctly, their 
fermentation or culture requirements, and the ease of purification of the products coded for by 
the nucleotide sequence. 
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The recombinant vector may be an autonomously replicating vector, i.e. a vec- 
tor which exists as an extrachromosomal entity, the replication of which is independent of 
chromosomal replication, e.g. a plasmid. Alternatively, the vector is one which, when intro- 
duced into a host cell, is integrated into the host cell genome and replicated together with the 
5 chromosome(s) into which it has been integrated. 

The vector is preferably an expression vector in which the nucleotide sequence 
encoding the polypeptide of the invention is operably linked to additional segments required 
for transcription of the nucleotide sequence. The vector is typically derived from plasmid or 
viral DNA. A number of suitable expression vectors for expression in the host cells mentioned 
10 herein are commercially available or described in the literature. Useful expression vectors for 
mammalian eukaryotic hosts include, for example, vectors comprising expression control se- 
quences from SV40, bovine papilloma virus, adenovirus and cytomegalovirus. Specific vec- 
tors are, e.g., pCDNA3.1(+)\Hyg (Invitrogen, Carlsbad, CA, USA) and pCI-neo (Stratagene, 
La Jolla, CA, USA). Useful expression vectors for yeast cells include the 2\x plasmid and de- 
ls rivatives thereof, the POT1 vector (US 4,931,373), the pJS037 vector described in Okkels, 
Ann. New York Acad. Sci. 782, 202-207, 1996, and pPICZ A, B or C (Invitrogen). Useful vec- 
tors for insect cells include pVL941, pBG311 (Gate et al., Cell 45, pp. 685-98 (1986)), pBlue- 
bac 4.5 and pMelbac (both available from Invitrogen). Useful expression vectors for bacterial 
hosts include known bacterial plasmids, such as plasmids from E. coli, including pBR322, 
20 pET3a and pET12a (both from Novagen Inc., WI, USA), wider host range plasmids, such as 
RP4, phage DNAs, e.g., the numerous derivatives of phage lambda, e.g. , NM989, and other 
DNA phages, such as M13 and filamentous single stranded DNA phages. 

Other vectors for use in this invention include those that allow the nucleotide 
sequence encoding the polypeptide to be amplified in copy number. Such amplifiable vectors 
25 are well known in the art. They include, for example, vectors able to be amplified by DHFR 
amplification (see, e.g., Kaufman, U.S. Pat. No. 4,470,461, Kaufman and Shaip, Mot Celt 
Biol. 2, pp. 1304-19 (1982)) and glutamine synthetase ("GS") amplification (see, e.g., US 
5,122,464 andEP 338,841). 

In one embodiment, a pair of expression vectors are used for expressing the 
30 polypeptide subunits of the invention. Each of the vectors of said pair is capable of transfect- 
ing a eukaryotic cell as described herein, and the vectors comprise nucleotide sequences en- 
coding, respectively, a modified FSH-a as described herein and a wildtype FSH-(3 subunit, a 
modified FSH-P as described herein and a wildtype FSH-a subunit, or a modified FSH-a and 
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a modified FSH-(3 as described herein. The use of a pair of vectors is e.g. described in EP 
211,894. Alternatively, a single expression vector comprising nucleotide sequences encoding 
both the FSH-oc subunit and the FSH-(3 subunit, where at least one of the subunits is modified 
as described herein, may be used for expressing the polypeptide subunits. 

5 The recombinant vector may further comprise a DNA sequence enabling the 

vector to replicate in the host cell in question. An example of such a sequence (when the host 
cell is a mammalian cell) is the SV40 origin of replication. When the host cell is a yeast cell, 
suitable sequences enabling the vector to replicate are the yeast plasmid 2\x replication genes 
REP 1-3 and origin of replication. 

10 The vector may also comprise a selectable marker, e.g. a gene whose product 

complements a defect in the host cell, such as the gene coding for dihydrofolate reductase 
(DHFR) or the Schizosaccharomyces pombe TPI gene (described by P.R. Russell, Gene 40, 
1985, pp. 125-130), or one which confers resistance to a drug, e.g. ampicillin, kanamycin, 
tetracyclin, chloramphenicol, neomycin, hygromycin or methotrexate. For Saccharomyces 

15 cerevisiae, selectable markers include ura3 and leu2. For filamentous fungi, selectable mark- 
ers include amdS, pyrG, arcB, niaD and sC. 

The term "control sequences" is defined herein to include all components 
which are necessary or advantageous for the expression of the polypeptide of the invention. 
Each control sequence may be native or foreign to the nucleic acid sequence encoding the 

20 polypeptide. Such control sequences include, but are not limited to, a leader sequence, 

polyadenylation sequence, propeptide sequence, promoter, enhancer or upstream activating 
sequence, signal peptide sequence, and transcription terminator. At a minimum, the control 
sequences include a promoter. 

A wide variety of expression control sequences may be used in the present in- 

25 vention. Such useful expression control sequences include the expression control sequences 
associated with structural genes of the foregoing expression vectors as well as any sequence 
known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, 
and various combinations thereof. 

Examples of suitable control sequences for directing transcription in mammal- 

30 ian cells include the early and late promoters of SV40 and adenovirus, e.g. the adenovirus 2 
major late promoter, the MT-1 (metallothionein gene) promoter, the human cytomegalovirus 
immediate-early gene promoter (CMV), the human elongation factor la (EF-la) promoter, 
the Drosophila minimal heat shock protein 70 promoter, the Rous Sarcoma Virus (RSV) pro- 
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moter, the human ubiquitin C (UbC) promoter, the human growth hormone terminator, S V40 
or adenovirus Elb region polyadenylation signals and the Kozak consensus sequence (Kozak, 
M. JMol Biol 1987 Aug 20;196(4):947-50). 

In order to improve expression in mammalian cells a synthetic intron may be 

5 inserted in the 5' untranslated region of the nucleotide sequence encoding the polypeptide. An 
example of a synthetic intron is the synthetic intron from the plasmid pCI-Neo (available from 
Promega Corporation, WI, USA). 

Examples of suitable control sequences for directing transcription in insect 
cells include the polyhedrin promoter, the P10 promoter, the Autographa californica polyhe- 

10 drosis virus basic protein promoter, the baculovirus immediate early gene 1 promoter and the 
baculovirus 39K delayed-early gene promoter, and the S V40 polyadenylation sequence. Ex- 
amples of suitable control sequences for use in yeast host cells include the promoters of the 
yeast a-mating system, the yeast triose phosphate isomerase (TPI) promoter, promoters from 
yeast glycolytic genes or alcohol dehydrogenase genes, the ADH2~4c promoter, and the in- 

15 ducible GAL promoter. Examples of suitable control sequences for use in filamentous fungal 
host cells include the ADH3 promoter and terminator, a promoter derived from the genes en- 
coding Aspergillus oryzae TAKA amylase triose phosphate isomerase or alkaline protease, an 
A. niger a-amylase, A. niger or A. nidulans glucoamylase, A. nidulans acetamidase, Rhizomu- 
cor miehei aspartic proteinase or lipase, the TPI1 terminator and the ADH3 terminator. Exam- 

20 pies of suitable control sequences for use in bacterial host cells include promoters of the lac 
system, the trp system, the TAC or TRC system, and the major promoter regions of phage 
lambda. 

The presence or absence of a signal peptide will, e.g., depend on the expression 
host cell used for the production of the polypeptide to be expressed (whether it is an intracel- 

25 Mar or extracellular polypeptide) and whether it is desirable to obtain secretion. For use in 
filamentous fungi, the signal peptide may conveniently be derived from a gene encoding an 
Aspergillus sp. amylase or glucoamylase, a gene encoding a Rhizomucor miehei lipase or pro- 
tease or a Humicola lanuginosa lipase. The signal peptide is preferably derived from a gene 
encoding A. oryzae TAKA amylase, A. niger neutral a-amylase, A. niger acid-stable amylase, 

30 or A. niger glucoamylase. For use in insect cells, the signal peptide may conveniently be de- 
rived from an insect gene (cf. WO 90/05783), such as the Lepidopteran manduca sexta adi- 
pokinetic hormone precursor, (cf. US 5,023,328), the honeybee melittin (Invitrogen), ecdys- 
teroid UDPglucosyltransferase (egt) (Murphy et al., Protein Expression and Purification 4, 



WO 01/58493 



PCT/DK01/00090 



44 

349-357 (1993) or human pancreatic lipase (hpl) {Methods in Enzymology 284, pp. 262-272, 
1997). A preferred signal peptide for use in mammalian cells is that of hFSH or the murine Ig 
kappa light chain signal peptide (Coloma, M (1992) /. Imm. Methods 152:89-104). For use in 
yeast cells suitable signal peptides have been found to be the a-factor signal peptide from 5. 
5 cereviciae (cf . US 4,870,008), a modified carboxypeptidase signal peptide (cf. L.A. Vails et 
al., Cell 48, 1987, pp. 887-897), the yeast BAR1 signal peptide (cf. WO 87/02670), the yeast 
aspartic protease 3 (YAP3) signal peptide (cf. M. Egel-Mitani et al., Yeast 6, 1990, pp. 127- 
137), and the synthetic leader sequence TA57 (W098/32867). For use in E. coli cells a suit- 
able signal peptide have been found to be the signal peptide ompA (EP581821). 

10 The nucleotide sequences of the invention encoding the dimeric polypeptide 

exhibiting FSH activity, whether prepared by site-directed mutagenesis, synthesis, PCR or 
other methods, may optionally also include a nucleotide sequence that encodes a signal pep- 
tide. The signal peptide is present when the polypeptide is to be secreted from the cells in 
which it is expressed. Such signal peptide, if present, should be one recognized by the cell 

15 chosen for expression of the polypeptide. The signal peptide may be homologous (e.g. be that 
normally associated with a hFSH subunit) or heterologous (i.e. originating from another 
source than hFSH) to the polypeptide or may be homologous or heterologous to the host cell, 
i.e. be a signal peptide normally expressed from the host cell or one which is not normally 
expressed from the host cell. Accordingly, the signal peptide may be prokaryotic, e.g. derived 

20 from a bacterium such as E. coli, or eukaryotic, e.g. derived from a mammalian, or insect or 
yeast cell. 

Any suitable host may be used to produce the polypeptide subunits of the in- 
vention, including bacteria, fungi (including yeasts), plant, insect, mammal, or other appropri- 
ate animal cells or cell lines, as well as transgenic animals or plants. Examples of bacterial 

25 host cells include gram-positive bacteria such as strains of Bacillus, e.g. B. brevis or B. sub- 
tilis, or Streptomyces, or gram-negative bacteria, such as Pseudomonas or strains of E. coli 
The introduction of a vector into a bacterial host cell may, for instance, be effected by proto- 
plast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168: 
111-115), using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacteriology 

30 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 56: 209- 
221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or 
conjugation (see, e.g., Koehler and Thome, 1987, Journal of Bacteriology 169: 5771-5278). 
Examples of suitable filamentous fungal host cells include strains of Aspergillus, e.g. A. 
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oryzae, A. niger, or A. nidulans, Fusarium or Trichoderma. Fungal cells may be transformed 
by a process involving protoplast formation, transformation of the protoplasts, and regenera- 
tion of the cell wall in a manner known per se. Suitable procedures for transformation of As- 
pergillus host cells are described in EP 238 023 and US 5,679,543. Suitable methods for 
5 transforming Fusarium species are described by Malardier et al., 1989, Gene 78: 147-156 and 
WO 96/00787. Examples of suitable yeast host cells include strains of Saccharomyces, e.g. S. 
cerevisiae, Schizosaccharomyces, Klyveromyces, Pichia, such as P. pastoris or P. methano- 
lica, Hansenula, such as H. Polymorpha or Yarrowia. Yeast may be transformed using the 
procedures described by Becker and Guarente, In Abelson, J.N. and Simon, M.I., editors, 

10 Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 
182-187, Academic Press, Inc., New York; Ito et al, 1983, Journal of Bacteriology 153: 163; 
Hinnen et al, 1978, PNAS USA 75: 1920: and as disclosed by Clontech Laboratories, Inc, 
Palo Alto, CA, USA (in the product protocol for the Yeastmaker™ Yeast Transformation 
System Kit). Examples of suitable insect host cells include a Lepidoptora cell line, such as 

15 Spodopterafrugiperda (Sf9 or Sf21) or Trichoplusioa ni cells (High Five) (US 5,077,214). 
Transformation of insect cells and production of heterologous polypeptides therein may be 
performed as described by Invitrogen. Examples of suitable mammalian host cells include 
Chinese hamster ovary (CHO) cell lines, (e.g. CHO-K1; ATCC CCL-61), Green Monkey cell 
lines (COS) (e.g. COS 1 (ATCC CRL-1650), COS 7 (ATCC CRL-1651)); mouse cells (e.g. 

20 NS/O), Baby Hamster Kidney (BHK) cell lines (e.g. ATCC CRL-1632 or ATCC CCL-10), 
and human cells (e.g. HEK 293 (ATCC CRL-1573)), as well as plant cells in tissue culture. 
Additional suitable cell lines are known in the art and available from public depositories such 
as the American Type Culture Collection, USA. Methods for introducing exogeneous DNA 
into mammalian host cells include calcium phosphate-mediated transfection, electroporation, 

25 DEAE-dextran mediated transfection, liposome-mediated transfection, viral vectors and the 
transfection method described by Life Technologies Ltd, Paisley, UK using Lipofectamin 
2000. These methods are well known in the art and e.g. described by Ausbel et al. (eds.), 
1996, Current Protocols in Molecular Biology, John Wiley & Sons, NY, USA, The cultivation 
of mammalian cells are conducted according to established methods, e.g. as disclosed in 

30 (Animal Cell Biotechnology, Methods and Protocols, Edited by Nigel Jenkins, 1999, Human 
Press Inc, Totowa, NJ, USA and Harrison MA and Rae IF, General Techniques of Cell Cul- 
ture, Cambridge University Press 1997). 
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In the production methods of the present invention, the cells are cultivated in a 
nutrient medium suitable for production of the polypeptide using methods known in the art. 
For example, the cell may be cultivated by shake flask cultivation, small-scale or large-scale 
fermentation (including continuous, batch, fed-batch, or solid state fermentations) in labora- 
5 tory or industrial fermenters performed in a suitable medium and under conditions allowing 
the polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nu- 
trient medium comprising carbon and nitrogen sources and inorganic salts, using procedures 
known in the art. Suitable media are available from commercial suppliers or may be prepared 
according to published compositions (e.g. in catalogues of the American Type Culture Collec- 

10 tion). If the polypeptide is secreted into the nutrient medium, it can be recovered directly 
from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates. 

The resulting polypeptide may be recovered by methods known in the art. For 
example, it may be recovered from the nutrient medium by conventional procedures includ- 
ing, but not limited to, centrifugation, filtration, extraction, spray drying, evaporation, or pre- 

15 cipitation. 

The polypeptides may be purified by a variety of procedures known in the art 
including, but not limited to, chromatography (e.g. ion exchange, affinity, hydrophobic, chro- 
matofocusing, and size exclusion), electrophoretic procedures (e.g. preparative isoelectric 
focusing), differential solubility (e.g. ammonium sulfate precipitation), SDS-PAGE, or extrac- 
20 tion (see e.g. Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH Publishers, 
New York, 1989). 



Pharmaceutical composition of the invention and its use 

In one aspect the polypeptide, the conjugate or the pharmaceutical composition 
25 according to the invention is used for the manufacture of a medicament for treatment of 
infertility or diseases associated with insufficient endogenous production of FSH. 

In another aspect the polypeptide, the conjugate or the pharmaceutical compo- 
sition according to the invention is used in a method of treating an infertile mammal, in par- 
ticular a human, comprising administering to the mammal in need thereof such polypeptide, 
30 conjugate or pharmaceutical composition. 

The polypeptide exhibiting FSH activity of the invention or the conjugate of 
the invention is administered at a dose approximately paralleling that employed in therapy 
with rhFSH such as Gonal-F® and Puregon®. However, due to the increased functional in 
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vivo half-life of the conjugate of the invention, it is contemplated that the product will be ad- 
ministered less frequently and at a dose which provides a comparable effect to that obtained in 
current therapy. It is thus contemplated that the composition of the invention may be adminis- 
tered at substantially less frequent intervals than currently available treatments, e.g. not more 

5 often than once every three days, such as not more than once every four, five, six or seven 
days. Accordingly, the exact dose to be administered will depend on the circumstances, in- 
cluding the patient to be treated, the cause of infertility if known, the status of the ovaries, the 
patient's plasma FSH concentration prior to treatment, and the functional in vivo half -life of 
the product. Normally, in the treatment of infertility the dose should be capable of stimulating 

10 follicle maturation, e.g. induce follicles to grow about 2 mm per day during a time period of 
8-9 days. For instance, for a product having a functional in vivo half-life of 3-4 days, two 
doses should be given at least three days apart if a relatively stable plasma concentration is 
desired. Analogously, for a product having a functional in vivo half-life of about 6 days, one 
dose would suffice during most of the stimulation period. 

15 The composition of the invention may be exceedingly advantageous when em- 

ployed in a step-down protocol, i.e. a protocol where decreasing dosages of FSH are given 
during the stimulation period, but where use of the composition of the invention, e.g. adminis- 
tered in one or two doses as outlined above, may provide such a slowly decreasing plasma 
concentration of FSH. 

20 It will be apparent to those of skill in the art that an effective amount of a con- 

jugate, preparation or composition of the invention depends, inter alia, upon the disease, the 
dose, the administration schedule, whether the polypeptide or conjugate or composition is 
administered alone or in conjunction with other therapeutic agents, the serum half-life of the 
compositions, and the general health of the patient. Typically, an effective dose of the conju- 

25 gate, preparation or composition of the invention is sufficient to ensure development and 
maturation of follicles at a rate and to a degree compatible with that obtained using standard 
rhFSH such as Gonal-F® and Puregon®. 

A further contemplated advantage is that the more stable plasma concentration 
obtained with a composition of the invention results in a more efficient development and 

30 maturation of follicles, which subsequently may enable a higher pregnancy rate. 

The polypeptide or conjugate of the invention is normally administered in a 
composition including one or more pharmaceutically acceptable carriers or excipients. "Phar- 
maceutically acceptable" means a carrier or excipient that does not cause any untoward effects 
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in patients to whom it is administered. Such pharmaceutical^ acceptable carriers and excipi- 
ents are well known in the art, and the polypeptide or conjugate of the invention can be for- 
mulated into pharmaceutical compositions by well-known methods (see e.g. Remington's 
Pharmaceutical Sciences, 18th edition, A. R. Gennaro, Ed., Mack Publishing Company 
5 (1990); Pharmaceutical Formulation Development of Peptides and Proteins, S. Frokjaer and 
L. Hovgaard, Eds., Taylor & Francis (2000); and Handbook of Pharmaceutical Excipients, 3rd 
edition, A. Kibbe, Ed., Pharmaceutical Press (2000)). Pharmaceutically acceptable excipients 
that may be used in compositions comprising the polypeptide or conjugate of the invention 
include, for example, buffering agents, stabilizing agents, preservatives, isotonifiers, non- 

10 ionic surfactants or detergents ("wetting agents"), antioxidants, bulking agents or fillers, che- 
lating agents and cosolvents. 

The pharmaceutical composition of the polypeptide or conjugate of the inven- 
tion may be formulated in a variety of forms, including liquids, e.g. ready-to-use solutions or 
suspensions, gels, lyophilized, or any other suitable form, e.g. powder or crystals suitable for 

15 preparing a solution. The preferred form will depend upon the particular indication being 
treated and will be apparent to one of skill in the art. 

The pharmaceutical composition containing the polypeptide or conjugate of the 
invention may be administered intravenously, intramuscularly, intraperitoneally, intrader- 
mally, subcutaneously, sublingualy, buccally, intranasally, transdermally, by inhalation, or in 

20 any other acceptable manner, e.g. using PowderJect® or ProLease® technology or a pen in- 
jection system. The preferred mode of administration will depend upon the particular indica- 
tion being treated and will be apparent to one of skill in the art. In particular, it is advanta- 
geous that the composition be administered subcutaneously, since this allows the patient to 
conduct the administration herself. 

25 The pharmaceutical composition of the invention may be administered in con- 

junction with other therapeutic agents. These agents may be incorporated as part of the same 
pharmaceutical composition or may be administered separately from the polypeptide or con- 
jugate of the invention, either concurrently or in accordance with any other acceptable treat- 
ment schedule. In addition, the polypeptide, conjugate or pharmaceutical composition of the 

30 invention may be used as an adjunct to other therapies. 

By obtaining a more stable FSH plasma concentration just above the threshold 
level for follicle growth, the composition of the invention is of particular interest for the treat- 
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treatment of women suffering from anovulation WHO type I, II or m, since only 1-2 mature 
follicles are desired in these patients. 

Furthermore, the invention relates in other aspects to the use of a composition 
of the invention in a step-down protocol where a decreasing plasma FSH concentration is ob- 
5 tained using only one or two injections, and preferably only a single injection, to the use of a 
composition of the invention in a step-up protocol where an increase in FSH concentrations is 
obtained faster using a lower individual as well as total dosage, and to the use of a composi- 
tion of the invention in combination with compounds for in vitro maturation (sterol deriva- 
tives such as FF-MAS and media containing growth and maturation factors known in the art). 

10 Mixtures of FSH and LH activities (hMG) are routinely used in the treatment 

of human infertility. This particular combination therapy may be advantageous because go- 
nadal support of gamete maturation is dependent upon the synergistic actions of both FSH and 
LH. Current treatment protocols requiring FSH and LH activity utilize urinary extracts from 
postmenopausal women. The use of these extracts is compromised by several factors, includ- 

15 ing variability. 

It will in some cases be advantageous to administer the composition of the in- 
vention as part of a treatment protocol that also involves LH and/or hCG, for example recom- 
binant LH and/or hCG. This may in particular be useful for treatment of women with low en- 
dogenous LH levels. Finally, the composition of the invention may be used, possibly in com- 

20 bination with LH, in the treatment of male infertility, in particular of hypogonadotrophic hy- 
pogonadism and oligo- or azoospermia. The more stable plasma concentration obtained with a 
composition of the invention may lead to a more efficient spermatogenesis. Also, a long last- 
ing effect would be particularly advantageous for such treatment due to the long-term treat- 
ment period of about three months. 

25 The present invention will be further illustrated by the following non-limiting 

methods and examples. 

Structure analysis methods 

Sequence numbering 

30 The amino acid sequence of hFSH-oc is numbered according to the mature se- 

quence shown in SEQ ID NO:2; an (a) suffix herein indicates the a chain. The amino acid 
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sequence of hFSH-p is numbered according to the mature sequence shown in SEQ ID NO:4; a 
(b) suffix herein indicates the (3 chain. 

Structures 

5 Human FSH-cc is identical to the a chain of Human Chorionic Gonadotropin 

(HCG) for which two published structures are available: Wu, H., Lustbader, J. W., Liu, Y., 
Canfield, R. R, Hendrickson, W. A.: Structure 2 pp. 545 (1994) and Lapthorn, A. J., Harris, 
D. C, Littlejohn, A., Lustbader, J. W. 9 Canfield, R. R, Machin, K. J., Morgan, F. J., Isaacs, N. 
W.: Nature 369 pp. 455 (1994), both including the P chain of HCG. The (3 chain of hFSH is 

10 32 percent identical to the amino acid sequence of the structural part of the 0 chain of HCG 
(see the sequence alignment of Figure 1). A series of 50 models of the 3D structure of FSH 
was built based on the above two available hCG structures and based on the sequence align- 
ment in Figure 1 using the program Modeller 98 (MSI Inc., 1999). The four N-terminal resi- 
dues (Al(a), P2(a), D3(a) and V4(a) as well as the three C-terminal residues (H90(a), K91(a) 

15 and S 92(a) were not modeled as they are not identified in the HCG structures. All of the 
hFSH-0 chain was modeled, even the part which has no homologous residues in the HCG 
structures. 

Accessible Surface Area (ASA) 

20 The computer program Access (B. Lee and F.M. Richards, J. Mol Biol 55: 

379-400 (1971)) version 2 (©1983 Yale University) was used to compute the accessible sur- 
face area (ASA) of the individual atoms in the structure. This method typically uses a probe- 
size of 1.4 A and defines the Accessible Surface Area (ASA) as the area formed by the center 
of the probe. Prior to this calculation all water molecules and all hydrogen atoms should be 

25 removed from the coordinate set, as should other atoms not directly related to the protein. 

Fractional ASA of side chain 

The fractional ASA of the side chain atoms is computed by division of the sum 
of the ASA of the atoms in the side chain with a value representing the ASA of the side chain 
30 atoms of that residue type in an extended Ala-x-Ala tripeptide, see Hubbard, Campbell & 
Thornton (1991) J. Mol Biol 220,507-530. For this example the CA atom is regarded as be- 
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ing a part of the side chain of glycine residues but not other residues. The following values are 
used as standard 100% ASA for the side chain: 



Ala 


69.23 


X 2 

A 


Leu 


140.76 


A 2 

A 


Arg 


200.35 


9 9 
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Lys 


162.50 
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Met 
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Ser 
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Glu 
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Thr 


101.67 
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Gly 


32.28 


A 2 


Trp 


210.89 


A 2 


His 


147.00 


A 2 


Tyr 


176.61 


A 2 


Ee 


137.91 


A 2 


Val 


114.14 


A 2 



Determination of surface exposed residues from structural models: 

Surface accessibility and fractional ASA of side chains were calculated for 
each of the 50 model structures. The average value over the structural ensemble was used in 
the following. The N- and C-terminal residues of the FSH-a chain not included in the model 
are defined as having 100% side chain accessibility. 

The following amino acid residues in hFSH-a and hFSH-P, respectively, have 
more than 25% of their side chain exposed to the surface: 

Al(a), P2(a), D3(a), V4(a), Q5(a), D6(a), P8(a), E9(a), Tll(a), L12(a), Q13(a), 
E14(a), P16(a), F17(a), Q20(a), P21(a), G22(a), A23(a), P24(a), L26(a), M29(a), F33(a), 
R42(a), S43(a), K44(a), K45(a), T46(a), L48(a), V49(a), Q50(a), N52(a), V61(a), K63(a), 
S64(a), Y65(a), N66(a), R67(a), V68(a), T69(a), M71(a), G72(a), G73(a), F74(a), K75(a), 
N78(a), T80(a), A81(a), H83(a), C84(a), S85(a), T86(a), Y88(a), Y89(a), H90(a), K91(a), 
S92(a), Nl(b), S2(b), E4(b), L5(b), T6(b), N7(b), 18(b), T9(b), K14(b), E15(b), E16(b), 
R18(b), F19(b), 121(b), S22(b), N24(b), Y31(b), Y33(b), R35(b), D36(b), L37(b), Y39(b), 
K40(b), D41(b), P42(b), A43(b), R44(b), P45(b), K46(b), 147(b), K49(b), K54(b), E55(b), 
L56(b), V57(b), Y58(b), E59(b), T60(b), V61(b), R62(b), P64(b), G65(b), A67(b), H68(b), 
H69(b), D71(b), L73(b), Y74(b), T75(b), T80(b), Q81(b), H83(b), G85(b), K86(b), D88(b), 
S89(b), D90(b), S91(b), D93(b), T95(b), V96(b), R97(b), G98(b), L99(b), G100(b), Y103(b), 
S105(b), F106(b), G107(b), E108(b), M109(b), Kl 10(b), and El 11(b). 

The following amino acid residues have more than 50% of their side chain ex- 
posed to the surface: 
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Al(a), P2(a), D3(a), V4(a), Q5(a), D6(a), P8(a), E9(a), Tll(a), Q13(a), E14(a), 
P16(a), F17(a), Q20(a), P21(a), G22(a), A23(a), K45(a), T46(a), L48(a), V49(a), Q50(a), 
N52(a), K63(a), S64(a), N66(a), R67(a), T69(a), G72(a), G73(a), K75(a), T86(a), Y89(a), 
H90(a), K91(a), S92(a), Nl(b), N7(b), T9(b), E15(b), E16(b), R18(b), F19(b), N24(b), 
5 Y33(b), D41(b), P42(b), A43(b), R44(b), P45(b), K46(b), 147(b), K54(b), E55(b), V57(b), 
Y58(b), E59(b), R62(b), P64(b), G65(b), A67(b), H68(b), H69(b), D71(b), L73(b), T75(b), 
Q81(b), H83(b), K86(b), D88(b), S89(b), D90(b), S91(b) ? T95(b), R97(b), G98(b), L99(b), 
G100(b), Y103(b) 3 S105(b) 5 F106(b), G107(b), E108(b), M109(b), Kl 10(b), and El 11(b). 

10 Determining distances between atoms 

The distance between atoms is most easily determined using molecular graph- 
ics software, e.g. Insight!! v. 98.0, MSI Inc. 



Example 1 

15 Construction of plasmids for expression of FSH 

A gene encoding the human FSH-oc subunit was constructed by assembly of 
synthetic oligonucleotides by PCR using methods similar to the ones described in Stemmer et 
al. (1995) Gene 164, pp. 49-53. The native FSH-ot signal sequence was maintained in order to 
allow secretion of the gene product. The codon usage of the gene was optimised for high ex- 

20 pression in mammalian cells. Furthermore, in order to achieve high gene expression, an intron 
(from pCI-Neo (Promega)) was included in the 5' untranslated region of the gene. The syn- 
thetic gene was subcloned behind the CMV promoter in pcDNA3.1/Hygro (Invitrogen). The 
sequence of the resulting plasmid, termed pBvdH977, is given in SEQ ID NO:5 (FSH-a- 
coding sequence at position 1225 to 1570). Similarly, a synthetic gene encoding the wildtype 

25 human FSH-P subunit was constructed. Also in this construct, the native signal sequence was 
maintained (except for a Lys to Glu mutation at position 2) in order to allow secretion, and the 
codon usage was optimised for high expression and an intron was included in the recipient 
vector (pcDNA3.1/Zeo (Invitrogen)). The sequence of the resulting FSH-|3-containing plas- 
mid, termed pBvdH1022, is given in SEQ ID NO:6 (FSH-|3-coding sequence at position 1231 

30 to 1617). A plasmid containing both the FSH-oc and the FSH-(3 encoding synthetic genes was 
generated by subcloning the FSH-a containing NruI-PvuTL fragment from pBvdH977 into 
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pBvdH1022 linearized with Nrul. The resulting plasmid, in which the FSH-cc and FSH-|3- 
expression cassettes are in direct orientation, was termed pBvdHHOO. 

Example 2 

5 Expression of FSH in CHO cells 

FSH was expressed in Chinese Hamster Ovary (CHO) Kl cells, obtained from 
the American Type Culture Collection (ATCC, CCL-61). 

For transient expression of FSH, cells were grown to 95% confluency in se- 
rum-containing media (MEMa with ribonucleotides and deoxyribonucleotides (Life Tech- 

10 nologies Cat # 32571-028) containing 1:10 FBS (BioWhittaker Cat # 02-701F) and 1:100 
penicillin and streptomycin (BioWhittaker Cat # 17-602E), or Dulbecco's MEM/Nut.-mix F- 
12 (Ham) L-glutamine, 15 mM Hepes, pyridoxine-HCl (Life Technologies Cat # 31330-038) 
with the same additives. FSH-encoding plasmids were transfected into the cells using 
Lipofectamine 2000 (Life Technologies) according to the manufacturer's specifications. 

15 24-48 hrs after transfection, culture media were collected, centrifuged and filtered through 
0.22 jLim filters to remove cells. 

Stable clones expressing FSH were generated by transfection of CHO Kl cells 
with FSH-encoding plasmids followed by incubation of the cells in selective media (for in- 
stance one of the above media containing 0.5 mg/ml zeocin for cells transfected with plasmid 

20 pBvdHl 100). Stably transfected cells were isolated and sub-cloned by limited dilution. 
Clones that produced high levels of FSH were identified by ELIS A (see below). 

Example 3 

Large-scale production of FSH in CHO cells 

25 The cell line CHO Kl 1 100-5, stably expressing human FSH, was passed 1 : 10 

from a confluent culture and propagated as adherent cells in serum-containing medium Dul- 
becco's MEM/Nut.-mix F-12 (Ham) L-glutamine, 15 mM Hepes, pyridoxine-HCl (Life Tech- 
nologies Cat # 31330-038), 1:10 FBS (BioWhittaker Cat # 02-701F), 1:100 penicillin and 
streptomycin (BioWhittaker Cat # 17-602E) until confluence in a 10 layer cell factory (NUNC 

30 #165250). The media was then changed to serum-free media: Dulbecco's MEMTNut.-mix F- 
12 (Ham) L-glutamine, pyridoxine-HCl (Life Technologies Cat # 21041-025) with the addi- 
tion of 1:500 ITS-A (Gibco/BRL# 51300-044), 1:500 EX-CYTE VLE (Serological Proteins 
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Inc. # 81-129) and 1:100 penicillin and streptomycin (BioWhittaker Cat # 17-602E). Subse- 
quently, every 24 h, culture media were collected and replaced with 1 fresh liter of the same 
serum-free media. The collected media was filtered through 0.22 jiim filters to remove cells. 
Growth in cell factories was continued with daily harvests and replacements of the culture 
5 media until FSH yields dropped below one-fourth of the initial expression level (typically 
after 10-15 days). 

Example 4 

Analysis of FSH forms by Western blotting and isoelectric focusing 

10 The FSH content of samples was analysed by Western blotting: Proteins were 

separated by SDS-PAGE and a standard Western blot was performed using rabbit anti human 
FSH (AHP519, Serotec) or mouse anti human FSH-fJ (MCA338, Serotec) as primary anti- 
body, and an ImmunoPure Ultra Sensitive ABC Peroxidase Staining Kit (Pierce) for detec- 
tion. Wild-type FSH produced as described above in Examples 1-3 was found to have the 

15 same mobility as FSH from references such as Puregon® (Organon) or Gonal-F® (Serono). 

For analysis of pi, samples were separated on pH 3-7 IEF gels (NO VEX). After 
electrophoresis, proteins were blotted onto Immobilon-P (Millipore) membranes and a West- 
ern blot was performed as described above, using the same antibodies and detection kit. In 
accordance with published observations (see, for instance, Loumaye et al. (1998) Human Re- 

20 prod. Update 4, 862-881), various FSH isoforms were detected, mostly in the pH 4-5.2 range 
for wildtype FSH. This is due to heterogeneity in carbohydrate content, most importantly 
sialic acid. 

Example 5 

25 Purification of FSH wildtype and variants 

Three chromatographic steps were employed to obtain highly purified FSH. 
First an anion exchanger step, then hydrophobic interaction chromatography (HIC) and finally 
an immunoaffinity step using an FSH-p specific monoclonal antibody. 

Culture supernatants were prepared as described in Example 3. Filtered culture 
30 supernatants were concentrated 10 to 20 times by ultrafiltration (10 IcD cut-off membrane), 
pH was adjusted to 8.0 and conductivity to 10 - 15 mS/cm, before application on a DEAE 
Sepharose (Pharmacia) anion exchanger column, which had been equilibrated in ammonium 
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acetate buffer (0.16 M, pH 8.0). Semipurified FSH was recovered both in the unbound flow- 
through fraction as well as in the wash fraction using 0.16 M ammonium acetate, pH 8.0. The 
flow through and wash fractions were pooled and ammonium sulfate was added from a stock 
solution (4.5 M) to obtain a final concentration of 1.5 M (NFL^SC^. The pH was adjusted to 
5 7.0. 

The partially purified FSH was subsequently applied on a 25 ml butyl Sepha- 
rose (Pharmacia) HIC column. After application, the column was washed with at least 3 col- 
umn volumes of 1.5 M (NEL^SC)^ 20 mM ammonium acetate, pH 7 (until the absorbance at 
280 nm reached baseline level) and FSH was eluted with 4 column volumes of buffer B (20 

10 mM ammonium acetate, pH 7). FSH enriched fractions from the HIC step were pooled, con- 
centrated and diafiltrated using Vivaspin 20 modules, 10 kD cut-off membrane (Vivascience), 
to a 50 mM sodium phosphate, 150 mM NaCl, pH 7.2. 

For the third chromatographic step, an anti-FSH~(3 monoclonal antibody (RDI- 
FSH909, Research Diagnostics) was immobilized to CNBr-activated Sepharose (Pharmacia) 

15 using a standard procedure from the supplier. Approximately 1 mg antibody was coupled per 
ml resin. The immunoaffinity resin was packed in plastic columns and equilibrated with 50 
mM sodium phosphate, 150 mM NaCl, pH 7.2 before application. 

The buffer exchanged eluate from the butyl HIC step was applied on the anti- 
body column by use of gravity flow. This was followed by several washing steps in 50 mM 

20 sodium phosphate solutions (0,5 M NaCl and 1 M NaCl, both pH 7.2). Elution was performed 
using either 1 M NH3 or 0.6 M NH3, 40% (v/v) isopropanol and the eluate was immediately 
neutralized with 1 M acetic acid to pH 6-8. 

The purified FSH bulk product was concentrated and diafiltrated using 
Vivaspin 20 modules, 10 kD cut-off membrane (Vivascience), to a 50 mM sodium phosphate, 

25 150 mM NaCl, pH 7.2. For subsequent storage, BSA was added to 0.1% (w/v) and the puri- 
fied FSH was microfiltrated using a 0.22 \xm filter prior to storage at - 80°C. 

SDS-PAGE, run under non-dissociating conditions (without boiling), showed 
wildtype FSH migrating as an apparant 42±3 kDa band, slightly diffuse due to heterogeneity 
in the attached carbohydrates. The purity was about 80-90%. N-terminal sequencing showed 

30 that the a-chain had the expected N-terminal sequence starting with residue 1 (SEQ ID NO: 2) 
and the (3-chain starting with residue 3 (SEQ ID NO:4). These N-terminal sequences have 
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been found previously for recombinant FSH produced in CHO cells (Olijve, W. et al. (1996) 
Mol Hum. Reprod. 2, 371-382). 

Example 6 

5 FSH in vitro activity assay 

6.1 FSH assay Outline 

It has previously been published that activation of the FSH receptor by FSH 
leads to an increase in the intracellular concentration of cAMP. Consequently, transcription is 
activated at promoters containing multiple copies of the cAMP response element (CRE). It is 
10 thus possible to measure FSH activity by use of a CRE luciferase reporter gene introduced 
into CHO cells expressing the FSH receptor. 

6.2 Construction of a CHO FSH-R / CRE-luc cell line 

Stable clones expressing the human FSH receptor were produced by transfec- 
15 tion of CHO Kl cells with a plasmid containing the receptor cDNA inserted into pcDNA3 
(Invitrogen) followed by selection in media containing 600 (ig/ml G418. Using a commercial 
cAMP-SPA RIA (Amersham), clones were screened for the ability to respond to FSH stimula- 
tion. On the basis of these results, an FSH receptor-expressing CHO clone was selected for 
further transfection with a CRE-luc reporter gene. A plasmid containing the reporter gene 
20 with 6 CRE elements in front of the Firefly luciferase gene was co-transfected with a plasmid 
conferring Hygromycin B resistance. Stable clones were selected in the presence of 600 |o,g/ml 
G418 and 400 jag/ml Hygromycin B. A clone yielding a robust luciferase signal upon stimula- 
tion with FSH (EC 50 ~ 0.01 IU/ml) was obtained. This CHO FSH-R / CRE-luc cell line was 
used to measure the activity of samples containing FSH. 

25 

63 FSH luciferase assay 

To perform activity assays, CHO FSH-R / CRE-luc cells were seeded in white 
96 well culture plates at a density of about 15,000 cells/well. The cells were in 100 [i\ 
DMEM7F-12 (without phenol red) with 1.25% FBS. After incubation overnight (at 37°C, 5% 
30 C0 2 ), 25 id of sample or standard diluted in DMEM/F-12 (without phenol red) with 10% 

FBS was added to each well. The plates were further incubated for 3 hrs, followed by addition 
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of 125 jxl LucLite substrate (Packard Bioscience). Subsequently, plates were sealed and lumi- 
nescence was measured on a TopCount luminometer (Packard) in SPC (single photon count- 
ing) mode. 

5 Example 7 

FSHELISA 

The concentration of FSH in samples was quantified by use of a commercial 
immunoassay (DRG FSH EIA, DRG Instruments GmbH, Marburg, Germany). DRG FSH 
EIA is a solid phase immunosorbent assay (ELIS A) based on the sandwich principle. The 

10 microtiter wells are coated with a monoclonal antibody directed towards a unique antigenic 
site on the FSH-0 subunit. An aliquot of FSH-containing sample (diluted in H 2 0 with 0.1% 
BSA) and an anti-FSH antiserum conjugated with horseradish peroxidase are added to the 
coated wells. After incubation, unbound conjugate is washed off with water. The amount of 
bound peroxidase is proportional to the concentration of FSH in the sample. The intensity of 

15 colour developed upon addition of substrate solution is proportional to the concentration of 
FSH in the sample. 

Example 8 

Animal studies 

20 The pharmakinetic profile of FSH and variant forms was determined as fol- 

lows: Immature 26-27 days old female Sprague-Dawley rats were injected i.v. with 3-4 \ig 
FSH, produced, purified and analyzed as described in Examples 1-7. Subsequently, blood 
samples were taken at various time-points after injection. FSH concentrations in serum sam- 
ples were determined by ELIS A, as described in Example 7. 

25 In vivo bioactivity of wildtype recombinant FSH and variant forms may be evaluated by the 
ovarian weight augmentation assay (Steelman and Pohley (1953) Endocrinology 53, 604- 
616). Furthermore, the ability of FSH and variant forms to stimulate maturation of follicles in 
laboratory animals may be detected with e.g. ultrasound equipment. 
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Example 9 

Construction and analysis of a variant form of FSH containing two N-linked glvcosvlations at 
the N~terminus of the a subunit 

A construct encoding a modified form of FSH-a, having two additional sites 
5 for N-linked glycosylation at its N-terminus was generated by site-directed mutagenesis using 
standard DNA techniques known in the art. A DNA fragment encoding the sequence Ala- 
Asn-He-Thr-Val-Asn-Ile-Thr-Val was inserted immediately upstream of the mature FSH-a 
sequence in pBvdH977. The sequence of the resulting plasmid, termed pBvdH1163, is given 
in SEQ ID NO:7 (modified FSH-a-encoding sequence at position 1225 to 1599). A plasmid 

10 encoding both subunits was constructed by subcloning the FSH-containing NruI-PvuU frag- 
ment from pBvdH1163 into pBvdH1022 (Example 1), which had been linearized with PvuU. 
The resulting plasmid was termed pBvdH1208. 

For expression of the variant form of FSH containing two N-linked glycosyla- 
tes at the N-terminus of the a subunit (termed FSH1208), CHO Kl cells were transfected 

15 with pBvdH1208 or co-transfected with a combination of pBvdHl 163, encoding the modified 
a subunit and pBvdH1022, encoding the wildtype (3 subunit. Transient expressions, isolation 
of stable expression clones, and large-scale production of FSH1208 were performed as de- 
scribed for wildtype FSH in Examples 2 and 3. 

Western blotting and isoelectric focusing were performed as described in Ex- 

20 ample 4. Western blotting showed that FSH1208 had a larger molecular mass than wildtype 
FSH, indicating that the introduction of acceptor sites for N-linked glycosylation at the N- 
terminus of the a subunit indeed lead to hyperglycosylation of FSH. Isoelectric focusing 
demonstrated that the FSH forms in the FSH1208 samples were found in a lower pi range 
than wildtype FSH. Thus, the pH interval for FSH1208 isoforms was about 3.0-4.5 versus 

25 about 4.0-5.2 for wildtype FSH. This indicated that FSH1208 molecules are on average more 
negatively charged than the wild type, which is attributed to the presence of additional sialic 
acid residues. 

FSH1208 was purified and characterized as described in Example 5, SDS- 
PAGE, run under non-dissociating conditions (without boiling), showed FSH1208 migrating 
30 as an apparent 55±5 kDa band, slightly diffuse due to heterogeneity in the attached carbohy- 
drates. The purity was about 80-90%. N-terminal sequencing showed that while the p-chain 
had the same N-terminal sequence as wildtype FSH, the sequence of a-chain was in agree- 
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ment with this subunit carrying the expected N-terminal extension ANrTVNTTV, in which 
both asparagines residues are glycosylated. 

The specific activity of FSH1208 was determined by measurement of the in vi- 
tro bioactivity (FSH luciferase assay, Example 6) and the FSH content of the samples (FSH 

5 ELISA, Example 7). The specific activity of FSH1208 was found to be about one-third of that 
of the wildtype reference. 

A pharmacokinetic study performed as described in Example 8 showed that 24 
hours after injection of equal amounts of wildtype FSH and FSH1208, the sera of FSH1208- 
treated animals contained more than 10 fold more remaining immunoreactive material than 

10 the sera from animals treated with wildtype FSH. 

Example 10 

Construction and analysis of other FSH variants containing additional glycosylation sites 

Plasmids encoding variant forms of FSH-a and FSH-p containing additional 
15 sites for N-linked glycosylation were generated by site-directed mutagenesis using standard 
DNA techniques known in the art. The following amino acid substitutions and/or insertions 
were generated: 

FSH1147: Amino acid Tyr58 of mature FSH-(3 altered to Asn 
FSH1349: N-terminus of mature FSH-a altered from APD QDC ... to: 
20 APNDTVNFT QDC . . . 

FSH1354: N-terminus of mature FSH-(3 altered from NS CEL ... to: 
NS NTrVNITV CEL... 

Plasmids encoding the variant forms were transiently expressed in CHO Kl 
cells as described in Example 2. Plasmids encoding FSH-a variants were co-transfected with 
25 a plasmid encoding wild-type FSH-P and vice versa. 

Western and isoelectric focusing were performed on culture media samples as 
described in Example 4. The variant forms had higher molecular weights than the wild-type, 
indicating that the additional acceptor sites for N-linked glycosylation had indeed been glyco- 
sylated. Furthermore, isoelectric focusing showed that the different isoforms of the three FSH 
30 variants were spread over a lower pi range than the wildtype. This strongly suggests that the 
variant forms had a higher sialic acid content than the wildtype. 
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In vitro FSH activities of the resulting media samples were analysed as de- 
scribed in Example 6.3. All three variant forms were able to stimulate the CHO FSH-R / 
CRE-luc cells, indicating that these variant FSH forms have retained significant FSH activity. 

5 While the foregoing invention has been described in some detail for purposes 

of clarity and understanding, it will be clear to one skilled in the art from a reading of this 
disclosure that various changes in form and detail can be made without departing from the 
true scope of the invention. For example, all the techniques, methods, compositions, apparatus 
and systems described above may be used in various combinations. All publications, patents, 

10 patent applications, or other documents cited in this application are incorporated by reference 
in their entirety for all purposes to the same extent as if each individual publication, patent, 
patent application, or other document were individually indicated to be incorporated by refer- 
ence for all purposes. 
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CLAIMS 

1 . A heterodimeric polypeptide conjugate exhibiting FSH activity, comprising 

i) a dimeric polypeptide comprising an FSH-oc subunit and an FSH-J3 subunit, 
wherein at least one of said FSH-a and FSH-|3 subunits differs from the corresponding wild- 

5 type subunit in that at least one amino acid residue acid residue comprising an attachment 
group for a non-polypeptide moiety has been introduced or removed, and 

ii) at least one non-polypeptide moiety bound to an attachment group of at least 
one of said subunits. 

10 2. The conjugate of claim 1, wherein the amino acid sequence of at least one of 

said FSH-oc and FSH-p subunits differs from that of the corresponding wildtype subunit in 
that an amino acid residue comprising an attachment group for the non-polypeptide moiety 
has been introduced. 

15 3. The conjugate of claim 2, wherein the introduced attachment group is selected 

from the group consisting of an N-glycosylation site, an O-glycosylation site, and an attach- 
ment group for a polymer molecule, a lipophilic compound, a carbohydrate moiety or an or- 
ganic derivatizing agent. 

20 4. The conjugate of any of claims 1-3, comprising at least one PEG molecule at- 

tached to an attachment group of at least one of the subunits. 

5. The conjugate of any of claims 1-4, comprising at least one introduced N- 
glycosylation site, and further comprising at least one PEG molecule attached to an attach- 

25 ment group of at least one of the subunits. 

6. The conjugate of claim 5, wherein said at least one PEG molecule is bound to 
the N-terminal of at least one of the subunits. 

30 7. The conjugate of any of claims 1-6, wherein the amino acid sequence of the 

FSH-oc subunit differs from that of wildtype human FSH-oc. 
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8. The conjugate of any of claims 1-6, wherein the amino acid sequence of the 
FSH-fJ subunit differs from that of wildtype human FSH-P. 

9. A heterodimeric polypeptide conjugate exhibiting FSH activity, comprising 

i) a dimeric polypeptide comprising an FSH-a subunit and an FSH-|3 subunit, 
wherein the amino acid sequence of at least one of said FSH-a and FSH-P subunits differs 
from that of the corresponding wildtype subunit in that at least one N-glycosylation site has 
been introduced, and 

ii) at least one oligosaccharide moiety bound to an N-glycosylation site of at 
least one of said subunits. 



10. The conjugate of claim 9, wherein at least one N-glycosylation site has been 

introduced into the FSH-a subunit by a mutation selected from the group consisting of 
P2(a)N+V4(a)S, P2(a)N+V4(a)T, D3(a)N+Q5(a)S, D3(a)N+Q5(a)T, V4(a)N+D6(a)S, 
V4(a)N+D6(a)S, D6(a)N+P8(a)S, D6(a)N+P8(a)T, E9(a)N+Tll(a)S, E9(a)N, 
Tll(a)N+Q13(a)S, Tll(a)N+Q13(a)T, L12(a)N+E14(a)S, L12(a)N+E14(a)T, 
E14(a)N+P16(a)S, E14(a)N+P16(a)T, P16(a)N+F18(a)S, P16(a)N+F18(a)T, F17(a)N, 
F17(a)N+S19(a)T, G22(a)N+P24(a)S, G22(a)N+P24(a)T, P24(a)N+L26(a)S, 
P24(a)N+L26(a)T, F33(a)N+R35(a)S, F33(a)N+R35(a)T, R42(a)N+K44(a)S, 
R42(a)N+K44(a)T, S43(a)N+K45(a)S, S43(a)N+K45(a)T, K44(a)N+T46(a)S, K44(a)N, 
K45(a)N+M47(a)S, K45(a)N+M47(a)T, T46(a)N+L48(a)S, T46(a)N+L48(a)T, 
L48(a)N+Q50(a)S, 148(a)N+Q50(a)T, V49(a)N+K51(a)S, V49(a)N+K51(a)T, 
Q50(a)N+N52(a)S, Q50(a)N+N52(a)T, V61(a)N+K63(a)S, V61(a)N+K63(a)T, 
K63(a)N+Y65(a)S, K63(a)N+Y65(a)T, S64(a)N+N66(a)S, S64(a)N+N66(a)T, 
Y65(a)N+R67(a)S, Y65(a)N+R67(a)T, V68(a)S, V68(a)T, R67(a)N+T69(a)S, R67(a)N, 
T69(a)N+M71(a)S, T69(a)N+M71(a)T, M71(a)N+G73(a)S, M71(a)N+G73(a)T, 
G72(a)N+F74(a)S, G72(a)N+F74(a)T, G73(a)N+K75(a)S, G73(a)N+K75(a)T, 
F74(a)N+V76(a)S, F74(a)N+V76(a)T, K75(a)N+E77(a)S, K75(a)N+E77(a)T, 
A81(a)N+H83(a)S, A81(a)N+H83(a)T, H83(a)N, T86(a)N+Y88(a)S, T86(a)N+Y88(a)T, 
Y88(a)N+H90(a)S, Y88(a)N+H90(a)T, Y89(a)N+K91(a)S, Y89(a)N+K91(a)T, H90(a)N and 
H90(a)N+S92(a)T. 
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1 1 . The conjugate of claim 9 or 10, wherein at least one N-glycosylation site has 
been introduced into the FSH-P subunit by a mutation selected from the group consisting of 
S2(b)N+E4(b)S, S2(b)N+E4(b)T, E4(b)N+T6(b)S, E4(b)N, L5(b)N+N7(b)S, L5(b)N+L7(b)T, 
T6(b)N+I8(b)S, T6(b)N+I8(b)T, I8(b)N+I10(b)S, I8(b)N+I10(b)T, T9(b)N+All(b)S, 
T9(b)N+All(b)T, K14(b)N+E16(b)S, K14(b)N+E16(b)T, F19(b)N+I21(b)S, 
F19(b)N+I21(b)T, I21(b)N+I23(b)S, I21(b)N+I23(b)T, S22(b)N+N24(b)S, 
S22(b)N+N24(b)T, Y31(b)N+Y33(b)S, Y31(b)N+Y33(b)T, Y33(b)N+R35(b)S, 
Y33(b)N+R35(b)T, R35(b)N+L37(b)S, R35(b)N+L37(b)T, D36(b)N+V38(b)S, 
D36(b)N+V38(b)T, L37(b)N+Y39(b)S, L37(b)N+Y39(b)T, K40(b)N+P42(b)S, 
K40(b)N+P42(b)T, A43(b)N+P45(b)S, A43(b)N+P45(b)T, P45(b)N+I47(b)S, 
P45(b)N+I47(b)T, K46(b)N+Q48(b)S, K46(b)N+Q48(b)T, I47(b)N+K49(b)S, 
I47(b)N+K49(b)T, K54(b)N+L56(b)S, K54(b)N+L56(b)T, E55(b)N+V57(b)S, 
E55(b)N+V57(b)T, L56(b)N+Y58(b)S, L56(b)N+Y58(b)T, V57(b)N+E59(b)S, 
V57(b)N+E59(b)T, Y58(b)N+T60(b)S, Y58(b)N, E59(b)N+V61(b)S, E59(b)N+V61(b)T, 
T60(b)N+R62(b)S, T60(b)N+R62(b)T, R62(b)N+P64(b)S, R62(b)N+P64(b)T, 
G65(b)N+A67(b)S, G65(b)N+A67(b)T, A67(b)N+H69(b)S, A67(b)N+H69(b)T, 
H68(b)N+A70(b)S, H68(b)N+A70(b)T, H69(b)N+D71(b)S, H69(b)N+D71(b)T, 
D71(b)N+L73(b)S, D71(b)N+L73(b)T, L73(b)N+T75(b)S, L73(b)N, T75(b)N+P77(b)S, 
T75(b)N+P77(b)T, H83(b)N+G85(b)S, H83(b)N+G85(b)T, K86(b)N+D88(b)S, 
K86(b)N+D88(b)T, D88(b)N+D90(b)S, D88(b)N+D90(b)T, S89(b)N, S89(b)N+S91(b)T, 
D90(b)N+T92(b)S, D90(b)N, S91(b)N+D93(b)S, S91(b)N+D93(b)T, D93(b)N+T96(b)S, 
D93(b)N, T95(b)N+R97(b)S, T95(b)N+R97(b)T, V96(b)N+G98(b)S, V96(b)N+G98(b)T, 
R97(b)N+L99(b)S, R97(b)N+L99(b)T, L99(b)N+P101(b)S, L99(b)N+P101(b)T, Y103(b)N, 
Y103(b)N+S105(b)T, S105(b)N+G107(b)S, S105(b)N+G107(b)T, F106(b)N+E108(b)S, 
F106(b)N+E108(b)T, G107(b)N+M109(b)S, G107(b)N+M109(b)T, E108(b)N+K110(b)S, 
E108(b)N+K110(b)T, M109(b)N+Elll(b)S, and M109(b)N+Elll(b)T. 

12. The conjugate of any of claims 9-11, wherein at least one of the FSH-a and 
FSH-P subunits comprises at least one N- or C-terminal peptide addition comprising at least 
one N-glycosylation site. 
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13. The conjugate of any of claims 9-12, which further comprises at least one non- 

polypeptide moiety different from an N- or O-linked oligosaccharide moiety bound to an at- 
tachment group of the polypeptide. 

5 14. The conjugate of any of claims 9-13, wherein the amino acid sequence of at 

least one of said FSH-a and FSH-P subunits further differs from that of the corresponding 
wildtype subunit in that at least one naturally occurring N-glycosylation site has been re- 
moved. 

10 15. A heterodimeric polypeptide conjugate exhibiting FSH activity, comprising a 

dimeric polypeptide comprising an FSH-a subunit and an FSH-P subunit, wherein at least one 
of said FSH-a and FSH-p subunits comprises a polymer molecule bound to the N-terminal 
thereof. 

15 16. The conjugate of claim 15, wherein the polymer molecule is polyethylene gly- 

col. 

17. The conjugate of claim 15 or 16, wherein at least one of said FSH-a and FSH-p 
subunit comprises, relative to the corresponding wildtype human subunit, at least one intro- 

20 duced amino acid residue comprising an attachment group for the polymer molecule, and/or 
wherein at least one amino acid residue comprising an attachment group for a polymer mole- 
cule has been removed. 

18. A heterodimeric polypeptide conjugate exhibiting FSH activity, comprising a 
25 dimeric polypeptide comprising FSH-a and FSH-P subunits, wherein at least one of said 

FSH-a and FSH-P subunits comprises, relative to the corresponding wildtype subunit, at least 
one introduced N- or O-glycosylation site at the N-terminal thereof, said at least one intro- 
duced glycosylation site being glycosylated. 

30 19. The conjugate of claim 18, wherein said at least one introduced N- or O- 

glycosylation site is part of an N-terminal peptide addition. 
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20. The conjugate of any of the preceding claims, wherein the FSH-a subunit com- 

prises hFSH-a having the sequence shown in SEQ ID NO:2, or the FSH-(3 subunit comprises 
hFSH-P having the sequence shown in SEQ ID NO:4. 

5 21. The conjugate of any of the preceding claims, wherein the amino acid sequence 

of the FSH-a and/or FSH-fS subunit differs in 1-20 amino acid residues from that of the corre- 
sponding wildtype sequence. 

22. The conjugate of any of the preceding claims, which has an increased functional 
10 in vivo half-life and/or serum half-life as compared to hFSH. 

23. The conjugate of any of the preceding claims, wherein the FSH-a subunit and 
the FSH-0 subunit are linked by a peptide bond or a peptide linker to form a single-chain 
polypeptide; or a single-chain polypeptide conjugate comprising at least two FSH-a subunits 

15 or at least two FSH-0 subunits, wherein at least one of said subunits differs from the corre- 
sponding wildtype FSH subunit as defined in any of the preceding claims. 

24. A composition comprising a conjugate according to any of claims 1-23 and at 
least one pharmaceutically acceptable carrier or excipient. 

20 

25. Use of conjugate according to any of claims 1-23 or a composition according to 
claim 24 as a pharmaceutical. 

26. Use of a conjugate according to any of claims 1-23 or a composition according 
25 to claim 24 for the manufacture of a medicament for treatment of infertility. 

27. A method of treating an infertile mammal, comprising administering to a mam- 
mal in need thereof an effective amount of a conjugate according to any of claims 1-23 or a 
composition according to claim 24. 

30 

28. A modified FSH-a polypeptide subunit having an amino acid sequence that dif- 
fers from that of the wildtype hFSH-a subunit in that at least one amino acid residue compris- 
ing an attachment group for a non-polypeptide moiety has been introduced. 
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29. A modified FSH-J3 polypeptide subunit having has an amino acid sequence that 
differs from that of the wildtype hFSH-(3 subunit in that at least one amino acid residue com- 
prising an attachment group for a non-polypeptide moiety has been introduced. 

5 

30. A nucleotide sequence encoding a modified FSH-a polypeptide according to 
claim 28 and/or a modified FSH-J3 polypeptide according to claim 29. 

31. An expression vector comprising a nucleotide sequence according to claim 30. 

10 

32. The expression vector of claim 31, comprising a nucleotide sequence encoding 
(a) a modified FSH-a subunit and a wildtype hFSH-|3 subunit, (b) a wildtype hFSH-a subunit 
and a modified FSH-p subunit, or (c) a modified FSH-a subunit and a modified FSH-P sub- 
unit. 

15 

33. A pair of expression vectors, each vector being capable of transfecting a eu- 
karyotic cell, the vectors comprising nucleotide sequences encoding, respectively, a modified 
FSH-a subunit according to claim 28 and a wildtype FSH-|3 subunit, a modified FSH-{3 sub- 
unit according to claim 29 and a wildtype FSH-a subunit, or a modified FSH-a subunit ac- 

20 cording to claim 28 and a modified FSH-p subunit according to claim 29. 



34. A host cell comprising a nucleotide sequence according to claim 30, an expres- 

sion vector according to claim 31 or 32, or a pair of expression vectors according to claim 33. 

25 35. The host cell of claim 34, which is a eukaryotic cell. 

36. The host cell of claim 35, which is a mammalian cell. 

37. A method for producing a recombinant heterodimeric FSH protein, comprising 



30 subjecting a host cell according to any of claims 34-36 comprising a nucleotide sequence en- 
coding an FSH-a subunit and an FSH-j3 subunit to cultivation under conditions conducive for 
expression of said subunits. 
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38. The method of claim 37, wherein the host cell is a eukaryotic cell capable of in 
vivo glycosylation, and the amino acid sequence of at least one of said FSH-oc and FSH-(3 
subunits differs from the sequence of the corresponding wildtype subunit in that at least one 
N-glycosylation site has been introduced. 

5 

39. The method of claim 38, further comprising subjecting the heterodimeric protein 
to in vitro conjugation to a non-polypeptide moiety. 

40. The method of claim 39, wherein the non-polypeptide moiety is a polymer moi- 
10 ety such as PEG. 
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FIGURE 1 



Sequence alignment of human FSH to the structural part of two published 
structures of Human Chorionic Gonadotropin ("1HRP" and "1HCN"). The "/" 
indicates the chain break between the alpha and the beta chain. 



FSH -QDCPECTLQ ENPFFSQPGA 

1HRP TQDCPECTLQ ENPFFSQPGA 

1HCN -QDCPECTLQ ENPFFSQPGA 

FSH TSESTCCVAK SYNRVTVMGG 

1HRP TSESTCCVAK SYNRVTVMGG 

1HCN TSESTCCVAK SYNRVTVMGG 



PILQCMGCCF SRAYPTPLRS KKTMLVQKNV 
PILQCMGCCF SRAYPTPLRS KKTMLVQKNV 
PILQCMGCCF SRAYPTPLRS KKTMLVQKNV 

FKVENHTACH CSTCYY/ -NSC EL TNI 

FKVENHTACH CSTCYY /KEP LRPRCRPINA 
FKVENHTACH CSTCYY /KEP LRPRCRPINA 



FSH TIAIEKEECR FCISINTTWC 

1HRP TLAVEKEGCP VCITVNTTIC 

1HCN TLAVEKEGCP VCITVNTTIC 

FSH ETVRVPGCAH HADSLYTYPV 

1HRP ESIRLPGCPR GVNPWSYAV 

1HCN ESIRLPGCPR GVNPWSYAV 



AGYCYTRDLV YKDPARPKIQ KTCTFKELVY 
AGYC PTMTRV LQGVLPALPQ WCNYRDVRF 
AGYC PTMTRV LQGVLPALPQ WCNYRDVRF 

ATQCHCGKCD S DS TDCTVRG LGPSYCSFGE 
ALSCQCALCR RSTTDCGGPK DHPLTCD . . . 
ALSCQCALCR RSTTDCGGPK DHPLTCD . . . 



FSH 

1HRP 

1HCN 



MKE 
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SEQUENCE LISTING 

<110> Maxygen ApS 

<120> Follicle stimulating hormones 
<130> 214WO100 
<160> 7 

<170> Patentln version 3.0 

<210> 1 

<211> 116 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Met Asp Tyr Tyr Arg Lys Tyr Ala Ala lie Phe Leu Val Thr Leu ser 
1 5 10 15 

val Phe Leu His val Leu His Ser Ala Pro Asp val Gin Asp cys Pro 
20 25 30 

Glu cys Thr Leu Gin Glu Asn Pro Phe Phe ser Gin Pro Gly Ala Pro 
35 40 45 

lie Leu Gin Cys Met Gly cys cys Phe ser Arg Ala Tyr Pro Thr Pro 
50 55 60 

Leu Arg ser Lys Lys Thr Met Leu val Gin Lys Asn val Thr ser Glu 
65 70 75 80 

ser Thr cys cys Val Ala Lys ser Tyr Asn Arg val Thr Val Met Gly 
85 90 95 

Gly Phe Lys val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr 
100 105 110 

Tyr His Lys Ser 
115 

<210> 2 

<211> 92 

<212> PRT 

<213> Homo sapiens 
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<400> 2 

Ala pro Asp val Gin Asp cys Pro Glu cys Thr Leu Gin Glu Asn Pro 
1 5 10 15 

Phe Phe Ser Gin Pro Gly Ala Pro lie Leu Gin cys Met Gly cys cys 
20 25 30 

Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu 
35 40 45 

Val Gin Lys Asn Val Thr ser Glu ser Thr Cys cys Val Ala Lys Ser 
50 55 60 

Tyr Asn Arg val Thr val Met Gly Gly Phe Lys Val Glu Asn His Thr 
65 70 75 80 

Ala cys His cys ser Thr cys Tyr Tyr His Lys ser 
85 90 

<210> 3 

<211> 129 

<212> PRT 

<213> Homo sapiens 



<400> 3 

Met Lys Thr Leu Gin Phe Phe Phe Leu Phe Cys Cys Trp Lys Ala lie 
15 10 15 

Cys Cys Asn Ser Cys Glu Leu Thr Asn lie Thr lie Ala He Glu Lys 
20 25 30 

Glu Glu Cys Arg Phe cys lie ser lie Asn Thr Thr Trp cys Ala Gly 
35 40 45 

Tyr cys Tyr Thr Arg Asp Leu val Tyr Lys Asp Pro Ala Arg Pro Lys 
50 ~ 55 60 

lie Gin Lys Thr cys Thr Phe Lys Glu Leu val Tyr Glu Thr val Arg 
65 ' 70 75 80 

Val Pro Gly cys Ala His His Ala Asp Ser Leu Tyr Thr Tyr Pro Val 
85 90 95 

Ala Thr Gin cys His cys Gly Lys Cys Asp ser Asp Ser Thr Asp cys 
100 105 110 

Thr Val Arg Gly Leu Gly Pro Ser Tyr Cys Ser Phe Gly Glu Met Lys 
115 120 125 

Glu 

<210> 4 

<211> 111 

<212> PRT 

<213> Homo sapiens 
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<400> 4 

Asn Ser cys Glu Leu Thr Asn lie Thr lie Ala lie Glu Lys Glu Glu 
15 10 15 

Cys Arg Phe cys lie ser lie Asn Thr Thr Trp cys Ala Gly Tyr Cys 
20 25 30 

Tyr Thr Arg Asp Leu Val Tyr Lys Asp Pro Ala Arg pro Lys lie Gin 
35 40 45 

Lys Thr Cys Thr Phe Lys Glu Leu Val Tyr Glu Thr val Arg val Pro 
50 55 60 

Gly Cys Ala His His Ala Asp Ser Leu Tyr Thr Tyr Pro Val Ala Thr 
65 70 75 80 

Gin Cys His cys Gly Lys cys Asp Ser Asp ser Thr Asp cys Thr val 
85 90 95 

Arg Gly Leu Gly Pro ser Tyr cys Ser Phe Gly Glu Met Lys Glu 
100 105 110 

<210> 5 
<211> 6186 
<212> DNA 

<213> Artificial sequence 
<220> 

<221> exon 

<222> (1225). .(1572) 

<223> Coding sequence for human FSH-alpha 



<400> 5 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
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tcgctattac 


catqqtqatq 


cggttttggc 


agtacatcaa 


tqqqcqtqqa 


taacaqtttq 


660 


actcacqqqq 


atttccaagt 


ctccacccca 


ttgacgtcaa 


tqqqaqtttq 


ttttggcacc 


720 


aaaatcaacg 


ggactttcca 


aaatgtcgta 


acaactccgc 


cccattgacg 


caaatgggcg 


780 


qtaqqcqtqt 


acqqtqqqaq 


gtctatataa 


gcagagctct 


ctggctaact 


agagaaccca 


840 


ctgcttactg 


gcttatcgaa 


attaatacga 


ctcactatag 


ggagacccaa 


qctqqctaqc 


900 


ttattqcqqt 


agtttatcac 


agttaaattg 


ctaacgcagt 


cagtgcttct 


gacacaacag 


960 


tctcgaactt 


aagctgcagt 


gactctctta 


aggtagcctt 


gcagaagttg 


gtcgtgaggc 


1020 


actgggcagg 


taagtatcaa 


ggttacaaga 


caggtttaag 


gagaccaata 


gaaactgggc 


1080 


ttgtcgagac 


agagaagact 


cttgcgtttc 


tgataggcac 


ctattggtct 


tactgacatc 


1140 


cactttgcct 


ttctctccac 


aggtgtccac 


tcccagttca 


attacagctc 


ttaaaagctt 


1200 


ggtaccgagc 


tcggatccgc 


cacc atg gac tac tac cgc aag tac 
Met Asp Tyr Tyr Arg Lys Tyr 


gcc gcc 
Ala Ala 


1251 



ate ttc ctg gtg acc ctg age gtg ttc ctg cac gtg ctg cac age gcc 1299 
He Phe Leu Val Thr Leu Ser Val Phe Leu His VaT Leu His Ser Ala 
10 15 20 25 

ccc gac gtg cag gac tgc ccc gag tgc acc ctg cag gag aac ccc ttc 1347 
Pro Asp val Gin Asp cys Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe 
30 35 40 

ttc age cag ccc ggc gcc ccc ate ctg cag tgc atg ggc tgc tgc ttc 1395 
Phe Ser Gin Pro Gly Ala Pro lie Leu Gin cys Met Gly cys cys Phe 
45 50 55 

age cgc gcc tac ccc acc ccc ctg cgc age aag aag acc atg ctg gtg 1443 
Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val 
60 65 ' 70 

cag aag aac gtg acc age gag age acc tgc tgc gtg gcc aag age tac 1491 
Gin Lys Asn Val Thr Ser Glu ser Thr cys Cys Val Ala Lys Ser Tyr 
75 80 85 

aac cgc gtg acc gtg atg ggc ggc ttc aag gtg gag aac cac acc gcc 1539 
Asn Arg Val Thr val Met Gly Gly Phe Lys val Glu Asn His Thr Ala 
90 95 100 105 



tgc cac tgc age acc tgc tac tac cac aag age 
Cys His cys Ser Thr cys Tyr Tyr His Lys ser 
110 115 


taatctagag 


ggcccgttta 


1592 


aacccgctga 


tcagcctcga 


ctgtgccttc 


tagttgccag 


ccatctgttg 


tttgcccctc 


1652 


ccccgtgcct 


tccttgaccc 


tggaaggtgc 


cactcccact 


gtcctttcct 


aataaaatga 


1712 


ggaaattgea 


tcgcattgtc 


tgagtaggtg 


tcattctatt 


ctggggggtg 


gggtggggca 


1772 


ggacagcaag 


ggggaggatt 


gggaagacaa 


tagcaggcat 


gctggggatg 


cggtgggctc 


1832 


tatggcttct 


gaggeggaaa 


gaaccagctg 


gggctctagg 


gggtatcccc 


acgcgccctg 


1892 


tageggegea 


ttaagegegg 


cgggtgtggt 


ggttacgcgc 


agcgtgaccg 


ctacacttgc 


1952 


cagcgcccta 


gcgcccgctc 


etttegcttt 


cttcccttcc 


tttctcgcca 


cgttcgccgg 


2012 


ctttccccgt 


caagctctaa 


ateggggcat 


ccctttaggg 


ttccgattta 


gtgctttacg 


2072 
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qcacctcgac 

ZJ c> 


cccaaaaaac 


ttgattaggg 


tgatggttca 


cgtagtgggc 


catcgccctg 


2132 


atagacggtt 


tttcgccctt 


tgacgttgga 


gtccacgttc 


tttaatagtg 


gactcttgtt 


2192 


ccaaactgga 


acaacactca 


accctatctc 


qqtctattct 

23 23 ^ ^ 


tttgatttat 


aagggatttt 

23 ZJ 23 


2252 


qqqqatttcq 

23 23 23 23 23 


gcctattggt 


taaaaaatga 


gctgatttaa 


caaaaattta 


acgcgaatta 


2312 


attctqtqqa 


atgtgtgtca 


gttagggtgt 


ggaaagtccc 


caggctcccc 


aggcaggcag 


2372 


aagtatgcaa 


agcatgcatc 


tcaattagtc 


agcaaccagg 


tgtggaaagt 


ccccaggctc 


2432 


cccagcaggc 


agaagtatgc 


aaagcatgca 


tctcaattag 


tcagcaacca 


tagtcccgcc 


2492 


cctaactccg 


cccatcccgc 


ccctaactcc 


gcccagttcc 


gcccattctc 


cgccccatgg 


2552 


ctgactaatt 


ttttttattt 


atgcagaggc 


cgaggccgcc 


tctgcctctg 


agctattcca 


2612 


gaagtagtga 


ggaggctttt 


ttggaggcct 


aggcttttgc 


aaaaagctcc 


cgggagcttg 


2672 


tatatccatt 


ttcggatctg 


atcagcacgt 


gatgaaaaag 


cctgaactca 


ccgcgacgtc 


2732 


tgtcgagaag 


tttctgatcg 


aaaagttcga 


cagcgtctcc 


gacctgatgc 


agctctcgga 


2792 


gggcgaagaa 


tctcgtgctt 


tcagcttcga 


tgtaggaggg 


cgtggatatg 


tcctgcgggt 


2852 


aaatagctgc 


gccgatggtt 


tctacaaaga 


tcgttatgtt 


tatcggcact 


ttgcatcggc 


2912 


cqcqctcccq 

23 23 23 


attccggaag 


tgcttgacat 


tggggaattc 


agcgagagcc 


tgacctattg 


2972 


catctcccgc 


cqtqcacaqq 


qtqtcacqtt 

23 23 23 


gcaagacctg 


cctgaaaccg 


aactgcccgc 


3032 


tgttctgcag 


ccqqtcqcqq 

23 23 23 23 23 


aggccatgga 


tgcgatcgct 


gcggccgatc 


ttagccagac 


3092 


qaqcqqqttc 


qqcccattcg 


gaccgcaagg 


aatcggtcaa 


tacactacat 


ggcgtgattt 


3152 


catatgcgcg 


attgctgatc 


cccatgtgta 


tcactggcaa 


actgtgatgg 


acgacaccgt 


3212 


cagtgcgtcc 


qtcgcgcaqq 


ctctcgatga 


gctgatgctt 


tgggccgagg 


actgccccga 


3272 


agtccggcac 


ctcgtgcacg 


cggatttcgg 


ctccaacaat 


gtcctgacgg 


acaatggccg 


3332 


cataacagcg 


gtcattgact 


qqaqcqaqqc 

23 23 23 23 23 23 


qatqttcqqq 

23 23 23 23 2/ 


gattcccaat 

ZJ 


acgaggtcgc 


3392 


caacatcttc 


ttctggaggc 


cqtqqttqqc 

ZJ 2323 2323 


ttqtatqqaq 

23 23 23 23 


cagcagacgc 


gctacttcga 


3452 


qcqqaqqcat 

23 23 23 23 


ccqqaqcttq 

^ ZJ Z/ *"* ZZJ zy 


caqqatcqcc 

23 23 23 


qcqqctccqq 

23 23 23 23 23 


gcgtatatgc 


tccgcattgg 


3512 


tcttgaccaa 


ctctatcaga 


gcttggttga 


cggcaatttc 


gatgatgcag 


cttgggcgca 


3572 


qqqtcqatqc 

23 23 23 23 23 


gacgcaatcg 


tccgatccgg 


agccgggact 


gtcgggcgta 


cacaaatcgc 


3632 


ccgcagaagc 


gcggccgtct 


ggaccgatgg 


ctgtgtagaa 


gtactcgccg 


atagtggaaa 


3692 


ccgacgcccc 


agcactcgtc 


cqaqqqcaaa 

^* 23 23 23 23 


ggaatagcac 


gtgctacgag 


atttcgattc 


3752 


caccgccgcc 


ttctatgaaa 


qqttqqqctt 

23 23 23 23 23 


cggaatcgtt 


ttccqqqacq 

w ^* 23 23 23 23 


ccggctggat 


3812 


gatxctccag 


cgcggggaxc 


ccaxgc ngga 


^ -4~* *+— ^— 

g l uc x ticgcc 


caccccaac l 


TigxTta txyc 


JO / £. 


agcttataat 


ggttacaaat 


aaagcaatag 


catcacaaat 


ttcacaaata 


aagcattttt 


3932 


ttcactgcat 


tctagttgtg 


gtttgtccaa 


actcatcaat 


gtatcttatc 


atgtctgtat 


3992 


accgtcgacc 


tctagctaga 


gcttggcgta 


atcatggtca 


tagctgtttc 


ctgtgtgaaa 


4052 


ttgttatccg 


ctcacaattc 


cacacaacat 


acgagccgga 


agcataaagt 


gtaaagcctg 


4112 
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aaatacctaa 


taaataaact 

Va VA LA. LA L> V-J LA V-J \> V— 


aactcacatt 

LA LA V_- L VLlLjfLA V. L- 


a at t a c a "t "t a 

LA LA L_ L- LA ^4 L- L* L^J 


ca etc acta c 

L— VA L< L_ V** LA L^ L» LA v» 


ccactttcca 

V* V- VA V- Lw L~ L> Vp- V-« LA 


4172 


atcaaaaaac 

" ^- *»-yyy*"* 


ctgtcgtgcc 


agctgcatta 


atgaatcggc 


caacacacaa 

^ "-^ ^- ZJ ZJ ZJ ZJ 


aaaaaaacoa 

yywywyy^-yy 


4232 


tttgcgtatt 


aaacactctt 

y yy y 


ccgcttcctc 


gctcactzgac 


"tcgctgcgct 


eggtegt'teg 


4292 


actacaaccia 


acaatatcaa 


ctcactcaaa 


aacaataalza 


eggtztatcca 


cagaatcagg 


4352 


aaataacaca 

va la la v. la la v«» va v* la 


aaaaaaaaca 

VA VA VA LA VA Vj LA LA \_* LA 


tataaacaaa 

V- VA L- LJ LA LA L> LA LA LA 


aaaccaacaa 

LA LA VA L^ \_ LA LA V_- LA LA 


aaaaccaaoa 

LA LA LA VA V- V_. LA LA VA LA, 


accataaaaa 

LA V— L— VA LA LA LA VA LA 


4412 


aaccacatta 


ctaacatttt 


"tccataaoct" 


ccacccccct 


aacaaacatc 

y u v> y u. y v> u. 


acaaaaatca 


4472 


acactcaaat 

LA V— VJ L_* U L- 


^"y^yy L yy 


aaaacccaac 

LA LA LA V- LA LA L*. 


aaaactatiaa 

LA LA LA LA L* L- LA L- LA LA 


aaataccaaa 

LAVA LA V» LA L_ V— 1 W vj VA 


cattilzccccc 

V* VA Lw Vw V-, \_ L« L_ V_ 


4532 


taaaaactcc 


CtCCItClCCICt 

\mm V- VA t— VA 1 LA Lw L* 


ctcctalitcc 

L- Vp- Lo Vp» LA W, L- Vh 


QSLCCCtOCCQ 

VA LA X— *— LJ Lf LA 


cttaccaaat 

V^ L« L» LA V«* V» VA VA LA V> 


acctatccac 

LA Vp. L_ V. VA V# V— V- VA V 


4592 


ctttctccct 


tcaaaaaacci 


taacactttc 


tcaatgctca 


cgctgtaggt 


atctcagttc 


4652 


qqtqtaqqtc 

zj y zj ^^zp zJ 


gttcgctcca 


aqctqqqctq 

zj if 23 3 ^ ZJ 


tqtqcacqaa 


ccccccgttc 


aqcccqaccq 


4712 


ctgcgcctta 


tccggtaact 


atcgtcttga 


gtccaacccg 


gtaagacacg 


acttatcgcc 


4772 


actaacaaca 

LA Va* L- VA >^ Vn» la. va V— LA. 


accactaata 

Lj V*_ Vp- LA Vr» V. VA V«J L- LA 


acaaaattaa 

LA» V- LA LA LA LA L- L> LA LA 


caaaacaaaa 

ZJ — Zy Zs ZJ 


tatataaaca 

v.u ^»y *- u yy ^*y 


atactacaaa 


4832 


CIttCttaaaa 

VA 1 W V* V* Lp, V- \J LA LA, VA 


taataaccta 


actacaacta 


cactaaaaaa 


acaatattta 


atatrctacac 

y <_ d V- ^— y * — y 


4892 


tctgctgaag 


ccaattacct 


"tcaaaaaaaa 

L» L— U LA LA LA LA LA LA LA 


aattaataac 

LA L» L- LA LA L- LA LA 


"tcttaatcca 


acaaacaaac 

VA L, LA LA LA L_« LA LA LA V_ 


4952 


caccgctggt 


aacaataatt 

*-*y *-y y *-yy *- *- 


tttttattta 

Ljp Lm L> L- L> LA L. W L- M 


caaacaacaa 

v.* la la la l^ la la LA VA 


attacacaca 

LA L- V- LA V-» LA V-# LA V- LA 


aaaaaaaaaa 

yu.uu.uuuuy y 


5012 


atctcaagaa 


gatcctttga 


t:ct"t"ttctac 


aaaatctaac 


actcaataaa 

y*— ^- y "-y y u 


acaaaaactc 

LA V»- VA LA LA VA LA V— L^ V_ 


5072 


acattaaaaa 


attttggtca 


tgagattatc 


aaaaaaaatc 

LA LA LA LA LA LA LA LA L- V,* 


ttcacctaga 


"tccttttaaa 

V- L— V-> L V L V LA LA LA 


5132 


ttaaaaatga 


agttttaaat 


caatctaaaa 

Vf- la la L_ V_- L- la la la la 


"tatatataaa 

V- LA L- LA L- LA V- VA LA VA 


"taaacttaat 

L- LA LA LA V— L- I— LA \A L. 


ctaacaatta 

i_y ux— uy V— i_ u 


5192 


ccaatgctta 


atcaataaaa 

u L y u yy 


cacctatctc 


aacaatctat 

LA LA X_i LA LA L- V* L^ VA L». 


ctatttcatt 

V- L- LA \mm Km V- V. VA La L- 


catccataat 

V-> LA V* Va, L. LA L- LA VA V- 


5252 


tgcctgactc 


cccgtcgtgt 


agataadiac 


aa*tacaaaaa 

y w Lw ^yyy w y 


ggcttzaccat 


ctggccccag 


5312 


tgctgcaatg 


ataccgcgag 


acccacgctc 


accggctcca 


gatzttaticag 


caataaacca 


5372 


accaaccaaa 

ZJ zJ ZJ ZJ 


aaaaccaaac 

yyy ^ v *y u y u 


acaaaaataa 

y^ w y wu y z)zJ 


"tcctgcaact: 


"ttatccgcct 


ccatccagtc 


5432 


tattaattgt 


taccaaaaaa 

ZJ ZJ zJ ZJ y 


ctagagtaag 


tagttcgcca 


gt"taa"tag"t*t 


tgcgcaacgt 


5492 


tgttgccatt 


gctacaggca 


i_ \— \~j *~ZJZJ ZJ 


acgctcgtcg 


i_ v- «_y y t-ui-yy 


cttcattcaa 

L« V* L-. V— LA La V. V^. LAVA 


5552 


ctccaattcc 

V- V- • VA WJ V- L- V— V- 


caacaatcaa 

L-L*LLLnMLA v. v—lala 


aacaaat"tac 

Vj LA Vh> LA LA LA L- LUv. 


ataa*t ccccc 

LA L- VA LA L- V_ Vh V— V— V— 


atattataca 

tx v_ y ih. i_ y i_ y \_ t4 


aaaaaacaat 

u,u.u,uuy v_ y y i_ 


5612 


tagctccttc 


ggtcctccga 




aaataaatta 


accacaatat: 

y^^y ^ w y y 


tat cac teat 


5672 


aattataaca 


acactacata 

^■J V* LA, L. LA V* LA L» LA 


a*t *t ct c t "t a c 


"tatcatacca 

y i~ v_ u. i_ 23 v— v. t*. 


*tcca*taaaa*t 

i_ v_ y Luuyu v. 


acttttctat 


5732 


aactaataaa 

y w >- zj z) *-y *-*y 


tactcaacca 

V* LA L> ViLAL\ VLtfVl 


a a "t c at: "t c"t a 


aaaataatat: 


atacaacaac 

ci k.y v^yyv-ycLv, 


caaattactc 


5792 


ttacccaaca 

*~ i -y v - , - v -yy v -y 


tcaatacaaa 

V, V- LA LA V. LA V^* VH LA Vj 


ataataccac 


acracataac 


aaaactttaa 


aaatactcat 




cattggaaaa 


cgttcttcgg 


ggcgaaaact 


ctcaaggatc 


ttaccgctgt 


tgagatccag 


5912 


ttcgatgtaa 


cccactcgtg 


cacccaactg 


atcttcagca 


tcttttactt 


tcaccagcgt 


5972 


ttctgggtga 


gcaaaaacag 


gaaggcaaaa 


tgccgcaaaa 


aagggaataa 


gggcgacacg 


6032 


gaaatgttga 


atactcatac 


tcttcctttt 


tcaatattat 


tgaagcattt 


atcagggtta 


6092 


ttgtctcatg 


agcggataca 


tatttgaatg 


tatttagaaa 


aataaacaaa 


taggggttcc 


6152 
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gcgcacattt ccccgaaaag tgccacctga cgtc 6186 

<210> 6 

<211> 5651 

<212> DNA 

<213> Arti f i ci al sequence 



<220> 

<221> exon 

<222> (1231) . . (1617) 

<223> Coding sequence for human FSH-beta 



<400> 6 
gacggatcgg 


gagatctccc 


gatcccctat 


ggtcgactct 


cagtacaatc 


tgctctgatg 


60 


ccgcatagtt 


aagccagtat 


ctgctccctg 


cttgtgtgtt 


ggaggtcgct 


gagtagtgcg 


120 


cgagcaaaat 


ttaagctaca 


acaaggcaag 


gcttgaccga 


caattgcatg 


aagaatctgc 


180 


ttagggttag 


gcgttttgcg 


ctgcttcgcg 


atgtacgggc 


cagatatacg 


cgttgacatt 


240 


gattattgac 


tagttattaa 


tagtaatcaa 


ttacggggtc 


attagttcat 


agcccatata 


300 


tggagttccg 


cgttacataa 


cttacggtaa 


atggcccgcc 


tggctgaccg 


cccaacgacc 


360 


cccgcccatt 


gacgtcaata 


atgacgtatg 


ttcccatagt 


aacgccaata 


gggactttcc 


420 


attgacgtca 


atgggtggac 


tatttacggt 


aaactgccca 


cttggcagta 


catcaagtgt 


480 


atcatatgcc 


aagtacgccc 


cctattgacg 


tcaatgacgg 


taaatggccc 


gcctggcatt 


540 


atgcccagta 


catgacctta 


tgggactttc 


ctacttggca 


gtacatctac 


gtattagtca 


600 


tcgctattac 


catggtgatg 


cggttttggc 


agtacatcaa 


tgggcgtgga 


tagcggtttg 


660 


actcacgggg 


atttccaagt 


ctccacccca 


ttgacgtcaa 


tgggagtttg 


ttttggcacc 


720 


aaaatcaacg 


ggactttcca 


aaatgtcgta 


acaactccgc 


cccattgacg 


caaatgggcg 


780 


gtaggcgtgt 


acggtgggag 


gtctatataa 


gcagagctct 


ctggctaact 


agagaaccca 


840 


ctgcttactg 


gcttatcgaa 


attaatacga 


ctcactatag 


ggagacccaa 


gctggctagc 


900 


ttattgcggt 


agtttatcac 


agttaaattg 


ctaacgcagt 


cagtgcttct 


gacacaacag 


960 


tctcgaactt 


aagctgcagt 


gactctctta 


aggtagcctt 


gcagaagttg 


gtcgtgaggc 


1020 


actgggcagg 


taagtatcaa 


ggttacaaga 


caggtttaag 


gagaccaata 


gaaactgggc 


1080 


ttgtcgagac 


agagaagact 


cttgcgtttc 


tgataggcac 


ctattggtct 


tactgacatc 


1140 


cactttgcct 


ttctctccac 


aggtgtccac 


tcccagttca 


attacagctc 


ttaaaagctt 


1200 


ggtaccgagc 


tcggatctat 


cgatgccacc 


atg gag acc ctg cag ttc ttc ttc 


1254 
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Met Glu Thr Leu Gin Phe Phe Phe 

1 5 

ctg ttc tgc tgc tgg aag gcc ate tgc tgc aac age tgc gag ctg acc 1302 

Leu Phe cys cys Trp Lys Ala lie cys cys Asn ser cys Glu Leu Thr 

10 15 20 

aac ate acc ate gcc ate gag aag gag gag tgc cgc ttc tgc ate age 1350 

Asn lie Thr lie Ala lie Glu Lys Glu Glu cys Arg Phe cys lie ser 

25 30 35 40 

ate aac acc acc tgg tgc gcc gqc tac tgc tac acc cgc gac ctg gtq 1398 

lie Asn Thr Thr Trp Cys Ala Giy Tyr Cys Tyr Thr Arg Asp Leu Val 

45 50 55 

tac aag gac ccc gcc cgc ccc aag ate cag aag acc tgc acc ttc aag 1446 

Tyr Lys Asp Pro Ala Arg Pro Lys lie Gin Lys Thr Cys Thr Phe Lys 

60 ^ 65 70 

gag ctg gtq tac gag acg gtc egg gtq ccc gqc tgc gcc cac cac gcc 1494 

Glu Leu Val Tyr Glu Thr val Arg val Pro Gly cys Ala His His Ala 

75 80 85 

gac age ctg tac acc tac ccc gtq gcc acc cag tgc cac tgc gqc aag 1542 

Asp ser Leu Tyr Thr Tyr Pro Val Ala Thr Gin cys His Cys Gly Lys 

90 * 95 100 

tgc gac age gac age acc gac tgc acc gtq cgc gqc ctg ggc ccc age 1590 

Cys Asp Ser Asp Ser Thr Asp Cys Thr Val Arg Gly Leu Gly Pro Ser 

105 110 115 120 

tac tgc age ttc ggc gag atg aag gag taactcgaga etagagggee 1637 
Tyr cys Ser Phe Gly Glu Met Lys Glu 
125 



cgtttaaacc 


cgctgatcag 


cctcgactgt 


gecttctagt 


tgccagccat 


ctgttgtttg 


1697 


cccctccccc 


gtgccttcct 


tgaccctgga 


aggtgecact 


cccactgtcc 


tttcctaata 


1757 


aaatgaggaa 


attgeatege 


attgtctgag 


taggtgtcat 


tctattctgg 


ggggtggggt 


1817 


ggggcaggac 


agcaaggggg 


aggattggga 


agacaatagc 


aggcatgetg 


gggatgcggt 


1877 


gggctctatg 


gcttctgagg 


eggaaagaac 


cagctggggc 


tctagggggt 


atccccacgc 


1937 


gccctgtagc 


ggcgcattaa 


gcgcggcggg 


tgtggtggtt 


acgcgcagcg 


tgaccgctac 


1997 


acttgccagc 


gccctagcgc 


ccgctccttt 


cgctttcttc 


ccttcctttc 


tcgccacgtt 


2057 


cgccggcttt 


ccccgtcaag 


ctctaaatcg 


gggcatccct 


ttagggttcc 


gatttagtgc 


2117 


tttaeggcac 


ctcgacccca 


aaaaacttga 


ttagggtgat 


ggttcacgta 


gtgggccatc 


2177 


gecctgatag 


aeggttttte 


gccctttgac 


gttggagtcc 


aegttcttta 


atagtggact 


2237 


cttgttccaa 


actggaacaa 


cactcaaccc 


tatcteggtc 


tattcttttg 


atttataagg 


2297 


gattttgggg 


attteggect 


attggttaaa 


aaatgagctg 


atttaacaaa 


aatttaaege 


2357 


gaattaattc 


tgtggaatgt 


gtgtcagtta 


gggtgtggaa 


agtccccagg 


ctccccaggc 


2417 


aggcagaagt 


atgeaaagea 


tgcatctcaa 


ttagtcagca 


accaggtgtg 


gaaagtcccc 


2477 


aggctcccca 


gcaggcagaa 


gtatgcaaag 


catgcatctc 


aattagtcag 


caaccatagt 


2537 


cccgccccta 


actccgccca 


tcccgcccct 


aactccgccc 


agttccgccc 


attctccgcc 


2597 


ccatggctga 


ctaatttttt 


ttatttatgc 


agaggecgag 


gccgcctctg 


cctctgagct 


2657 
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at "t c c 3. g aag 


taataaaaaa 


a c "t "t *t *t t "t a a 


"yy^^ *-*-*y y v - 


ttttacaaaa 


aactcccaaa 


2717 


a. a c *t t a t a. t a. 

*-A vi V_ VJ l_ VA- 1^ vA 


tccattttca 

i~ v* X— va v_ i__ \_ vj 


aatctaatca 

V| VA L_ V— I— VA V*. CL 


acacat*a't"ta 

L-\J V. «-y 


acaa"t"taa*tc 

UL.UU L 1>UULL< 


atcaocataa 

la l. ~y 23 23 


2777 


tatatcggca 


tagtataata 


caacaaaata 


aggaactaaa 


ccatggccaa 


gttgaccagt 


2837 


qccqttccaq 

23 23 w w ^* 23 23 


tgctcaccgc 


acacaacatc 


accaaaaccia 

y ^^-yy^-y ^-2323 


"tcgagtitctg 


aaccaaccaa 

23 ^* 23 *■* 23 23 


2897 


ctcgggttct 


cccgggactt 


cQtaaaaaac 


gacttcgccg 


atataatcca 

23 *"23 ^" 23 23 *~ ^ 23 


aaacaacata 

2323 -3 y 23 


2957 


accctattca 


tCSLOCQCQOt 

%m V*» VV VJ V* Vj V* WJ VI V 


ccaaaaccaa 

V« VJ V* V- ■ V« V*. VJ 


y L yy L y v -*-yy 


3LC3.aca.ccct 


23 y ^yyy *-y 


3017 


taaatacaca 


acctaaacaa 


actatacacc 


y ay uyy L.v_yy 


aa a teat ate 

u y y *- ^- 23 ^"23 


cacaaacttc 

V. LA Vrf L3 LA LA V» L- L> 


3077 


caaaacacct 


ccaaaccaac 


cataaccaaa 


atcaacaaac 


»y*-»-y L yyyy 


acaaaaattc 
y *—y yy **y »- l. v. 


3137 


gccctgcgcg 


acccaaccaa 


caactacata 


cacti: cat aa 


ccaaaaaaca 


aaactaacac 

23 23 v.yw.L.wL- 


3197 


gtgctacgag 


atttcgattc 


caccgccgcc 


ttctatgaaa 


aattaaactt 

2323 yyy ^ ^* 


caaaatcatt 

VJ VI lAU, VJ V. V 


3257 


ttccgggacg 


ccggctggat 


gatcctccag 


cacaaaaatc 

y y y yy ^* 


tcatactaaa 


attcttcacc 


3317 


caccccaact 


tgtttattgc 


agcttataat 


ggttacaaat 


aaagcaatag 


catcacaaat 


3377 


ttcacaaata 


aagcattttt 


ttcactgcat 


tctaattata 


atttatccaa 


actcatcaat 


3437 


gtatcttatc 


atatctatat 

VA V* Vj ^— V- V. Vj V. VA V 


accatcaacc 


"tctaactaaa 


acttaacata 

23 23 23 23 


at cat aa tea 

LA L. Lb* LA L. LJ LJ l_ LA 


3497 


tagctgtttc 


ctgtgtgaaa 


■ttattatcca 


ctcacaattc 


cacacaacat 


acaaaccaaa 

LAV-LJLAVJL-.V-LJLJLA 


3557 


agcataaagt 


gtaaagcctg 


aaatacctzaa 


"taaataaact 

23 ^ 23 , -23 t *23 v " 


aactcacatt 


aattacatta 


3617 


cgctcactgc 


ccgctttcca 


atcaaaaaac 

23 23 23 23 


ctgtcgtigcc 


agetgeatta 


ataaatcaac 

t-A V- VI IA IA V VI VJ V* 


3677 


caacgcgcgg 


aaaaaaacaa 

yy"-y w yy *-yy 


tttgcgtatt 


gggcgctcizt: 


ccgcttcctc 


actcactaac 

VJ V* L IA w \_ Vy VA ^* 


3737 


tcactacact 


cggtcgttcg 


23^* y yy y 


23 *~ 23 Z3 *- 23 


ctcactcaaa 


oocooTaalza 

Vj VI V_- Vf VJ V vA VA> VL 


3797 


cggttatcca 


cagaatcagg 


ggataacgca 


ggaaagaaca 


tgtgagcaaa 


aggecagcaa 


3857 


aaggccagga 


accgtaaaaa 


yy y y ^ 23 


ctaacatttl: 

V-^ V- ^-i V4 V-* \A L L L L. 


tccataaact 


ccacccccct 

\m> v« Vj • v* v« v» v« 


3917 


gacgagcatc 


acaaaaatcg 


acgctcaagt 


caaaaataac 


gaaacccgac 


aaaactataa 

V% VJ VJ vA, V^< V VI V V\> V*. 


3977 


agataccagg 


cgtttccccc 


tggaagctcc 


ctxgtgcgct: 


ctcctgttcc 


gaccctgccg 


4037 


cttaccggat 


acctgtccgc 


ctttctccct 


"tcaaaaaaca 

yyy wu y ^y 


■tggegcttte 


teaatgetea 


4097 


cactataaat 


atctcagttc 


2323 23 tu yy v-w 


attcactcca 


aactaaacta 

23 23 23 23 *~23 


tatacacaaa 

V VI V VJ V VA> N^i VJ IA VA 


4157 


ccccccgttc 


agcccgaccg 


ctacaccttra 

V. \A V— %rt V* Vp* W V\i 


tccaataact 


atcatcttaa 

xA. L. y *— L- L- L«J LA 


otccaaccco 


4217 


ataaaacaca 


acttatcacc 

LA V_ Vh L. LA 1_ * — VJ V_ V. 


actaacaaca 

t4. v_ y y v_ ca y v*ci 


accactaata 


araaaattaa 


c a a a a ca aaa 

L.ayayL.yuyy 


4277 


tatataaaca 


atactacaaa 

L. LJ V- *_ LA. \— UVj LA 


attcttaaaa 

23 ^- ^•y u - t *y 


taataaccta 

v.yy l. y y v_ i_ la, 


actacaacta 

L4. V_ LUV<yy V_ L.CL 


cactaaaaaa 

v,av» L. 0. y ci ca y y 


4337 


acaatattta 

VA VA ~J V IA *— VJ 


atatctacac 

y i_ l_ t- y v_y v- 


tctactaaaa 


ccaattacct 

L.L»Uy L. UUv>L> l— 


tcaaaaaaaa 

L- l_ y y cl cl a. la a. y 


aattaataac 

ca y l. Lyy uciy \- 


4397 


ll i_ i_ y cl l. v_ y 


yL.clclci^clcLcLL. 






IXLtLy LI. L9 


caagcLAgv-cig 


^Hj / 


attacgcgca 


gaaaaaaagg 


atctcaagaa 


gatcctttga 


tcttttctac 


ggggtctgac 


4517 


gctcagtgga 


acgaaaactc 


acgttaaggg 


attttggtca 


tgagattatc 


aaaaaggatc 


4577 


ttcacctaga 


tccttttaaa 


ttaaaaatga 


agttttaaat 


caatctaaag 


tatatatgag 


4637 


taaacttggt 


ctgacagtta 


ccaatgctta 


atcagtgagg 


cacctatctc 


agegatctgt 


4697 
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ctatrttcatt: 


CSLtlCC3L't3LO't 
ia i— v* v_ ia i— *Ay u 


"tacctaactc 

Ly v.tu (,yu\> u \_ 


u, u-u.y LLy ty l 


aaataactac 

uy u i» wiu\w i>uv 


Qcl'tlS.CQQQSLQ 

y u (A . y y y uy 


4757 


yy v. u «- ci V— v_ ca u 




Ly v. u y u. clgl u y 


ci u ci. v_ v_ y v_ y ci y 


a V— v_ v_ ci v_ y v_ u Vw. 


ca u. y y v* u u> v>ca 


4817 


y cx l u ua u u. a y 


V— Cl Cl l. a. CI a. V. V_ C4. 


nrranrrnna 




yuayaay Lyy 


trrtnraart 

u u, v_ Ly uuuu u 


4877 


1~ 1" 3 1" C rn rrt 

L LCl LLLy L 


rratrrantr 

v_ v_a u v_ u ay i_ v„ 


1~a l-^aal-'frii- 
ua u Laa u Ly u 


1"fifTfinnaari 
u y u, v_ y y y clay 


r1"a nantaan 

u, Layay Laay 


ucxy u v.uy uuci 


4937 


n t" 1" a a 1" a n*f t* 

y u uaa Lu.y u l 


tararaarcit 


t* a t* t~ o r r a Tt* 

v- y u uyv_v_.cii_ u 


art" a raaora 

y v_ uciVw.ciyyv_.ci 


1" c a t* a a t* a t* r 

uuy uyy uy u v. 


a rartratra 

ci u. y v_ u u. y l uy 


4997 


u u tyy ua Lyy 


rt"t*rat"*tran 


rtccaattrc 

v— <— v_. V— y y l l v. 


caacaatraa 


yy v— y v_ i.uv. 


ataatrrrrc 

VA i_ y IA l_ \w V_ V_. V_ V- 


5057 


at"a*t*tat:aca 

*a Ly *— u y u y \>u 


aaaaaacaat 

uuuuuy ^>My u 


t* a a c t* c c 1 1 c 


aa"tcctccaa 


i_ v.. y u uy uv_uvy 


aaat_aaat_"ta 


5117 


accacaatat 

yv-^y *w«y uy u 


tatcac "teal" 

U CA l> V— C*. V_ L \_ a U 


aattataaca 


acactacata 

y v.u_ i- y v_ a uci 


a 1 1 c *t ct t a c 

U U U U. U\w> U L(aU 


u y u v— ci uy v— v_. ia 


5177 


trcataaaat 

uv_u-y u a ci y c.v u 


a r "t t* "t T c T a t" 

y ^ L L L L L- u y U 


aactaataaa 

y au. ty y uy ay 


■tactcaa era 

L d V_ U V_. Cl CJ. U d 


ci y u v_ ci u u v_ uy 


aaaataatat 

ci y ci ca u ia y uy u 


5237 




rnanf trirtr 

u. y ay u Ly l u v_ 


"\rTncccc\acn 

u Lyv-LLyyuy 


u v_ aa uo.Vw.yyy 


ci u ci ci Luuuy u 


occ ar at - aac 

y uuuUu Lay u 


5297 

-J d— ~J 1 


aaaactttaa 

tA y tA tA v_ l i_ ma 


aaatactcat 

c-t c-i y i— y »_, i— »_ *a i— 


cattaaaaaa 

i_ *-yy u ^ u( ^ 


v_ y u >- »— «- u\-y y 


aacaaaaact 


ctcaaaaatc 

•w. L- \_ 4. IA y (A 1_ V- 


5357 


ttaccgctgt 


tgagatccag 


ttcgatgtaa 


cccactcgtg 


cacccaactg 


atcttcagca 


5417 


tcttttactt 


tcaccagcgt 


ttctgggtga 


gcaaaaacag 


gaaggcaaaa 


tgccgcaaaa 


5477 


aagggaataa 


gggcgacacg 


gaaatgttga 


atactcatac 


tcttcctttt 


tcaatattat 


5537 


tgaagcattt 


atcagggtta 


ttgtctcatg 


ageggataca 


tatttgaatg 


tatttagaaa 


5597 


aataaacaaa 


taggggttcc 


gcgcacattt 


ccccgaaaag 


tgccacctga 


cgtc 


5651 



<210> 7 

<211> 6213 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> exon 

<222> C1225) . . C1599) 

<223> Coding sequence for modified FSH-alpha 



<400> 7 



gaeggategg 


gagatctccc 


gatcccctat 


ggtcgactct 


cagtacaatc 


tgctctgatg 


60 


ccgcatagtt 


aagecagtat 


ctgctccctg 


cttgtgtgtt 


ggaggtcget 


gagtagtgcg 


120 


cgagcaaaat 


ttaagctaca 


acaaggcaag 


gcttgaccga 


caattgeatg 


aagaatctgc 


180 


ttagggttag 


gcgttttgcg 


ctgcttcgcg 


atgtacgggc 


cagatatacg 


cgttgacatt 


240 


gattattgac 


tagttattaa tagtaatcaa ttacggggtc 


attagttcat 


ageccatata 


300 


tggagttccg 


cgttacataa 


ettaeggtaa 


atggcccgcc 


tggctgaccg 


cccaacgacc 


360 
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cccgcccatt 


gacgtcaata 


atgacgtatg 


ttcccatagt 


aacgecaata 


aaaactttcc 


420 


attgacgtca 


ataaataaac 


tatttacggt 


aaactgccca 


cttggcagta 


catcaagtgt 


480 


atcatatgcc 


aagtacgccc 


cctattgacg 


teaatgaegg 


taaatggccc 


gectggcatt 


540 


atgcccagta 


catgacctta 


taaaactttc 


ctacttggca 


gtacatctac 


gtattagtca 


600 


t cactattac 


cat aataato 


caattttaac 

*«yy <- *- «-yy v 


aatacat caa 


taaacataaa 


taacaattta 


660 


actcacaaaa 


atttccaaat 


ctccacccca 


ttaacatcaa 


taaaaattta 
c yyy y " 


ttttaacacc 


720 


uaua v— \— n. y 


aaactttcca 

y y c*. v. u u \_ u 




acaactccac 


cccattaaca 


caaataaaca 

^uuu y y y v«y 


780 


ataaacatat 


acaataaaaa 


atctatataa 


acaaaactct 

y \— w y w y v» 


ctaactaact 


aaaaaaccca 


840 


ctgcttactg 


gcttatcgaa 


attaatacga 


ctcactatag 


ggagacccaa 


actaactaac 


900 


ttattgcggt 


agtttatcac 


agttaaattg 


etaaegcagt 


cagtgettet 


gacacaacag 


960 


tctcgaactt 


aagctgcagt 


gactctctta 


aggtagcett 


gcagaagttg 


gtcgtgaggc 


1020 


actgggcagg 


taagtatcaa 


ggttacaaga 


caggtttaag 


gagaccaata 


gaaactgggc 


1080 


ttgtcgagac 


agagaagact 


cttgcgtttc 


tgataggcac 


ctattggtct 


tactgacatc 


1140 


cactttgcct 


ttctctccac 


aggtgtccac 


tcccagttca 


attacagctc 


ttaaaagctt 


1200 


ggtaccgagc 


tcggatccgc 


cacc atg gae tac tac cgc aag tac 
Met Asp Tyr Tyr Arg Lys Tyr 


gcc gcc 
Ala Ala 


1251 



1 5 

ate ttc ctg gtg acc ctg age gtg ttc ctg cac gtg ctg cac age gcc 1299 
lie Phe Leu val Thr Leu ser val Phe Leu His VaT Leu His Ser Ala 
10 15 20 25 

aac ate acc gtt aac ate acc gtg gcc ccc gae gtg cag gae tgc ccc 1347 
Asn lie Thr val Asn lie Thr VaT Ala Pro Asp VaT Gin Asp cys Pro 
30 35 40 

gag tgc acc ctg cag gag aac ccc ttc ttc age cag ccc gqc gcc ccc 1395 
Glu Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro GTy Ala Pro 
45 50 55 

ate ctg cag tgc atg gqc tgc tgc ttc age cgc gcc tac ccc acc ccc 1443 
lie Leu Gin cys Met GTy cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro 
60 65 70 

ctg cgc age aag aag acc atg ctg gtg cag aag aac gtg acc age gag 1491 
Leu Arg ser Lys Lys Thr Met Leu VaT Gin Lys Asn VaT Thr ser Glu 
75 80 85 

age acc tgc tgc gtg gcc aag age tac aac cgc gtg acc gtg atg gqc 1539 
ser Thr cys cys VaT Ala Lys ser Tyr Asn Arg VaT Thr Val Met GTy 
90 95 100 105 

ggc ttc aag gtg gag aac cac acc gcc tgc cac tgc age acc tgc tac 1587 
Gly Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr 
110 115 120 

tac cac aag age taatctagag ggcccgttta aacccgctga tcagcctcga 1639 
Tyr His Lys ser 
125 

ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 1699 
tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgea tcgcattgtc 1759 
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tgagtaggtg 


tcattctatt 


ctggggggtg 


gggtggggca 


ggacagcaag 


ggggaggatt 


1819 


gggaagacaa 


tagcaggcat 


gctggggatg 


cggtgggctc 


tatggcttct 


gaggcggaaa 


1879 


gaaccagctg 


gggctctagg 


gggtatcccc 


acgcgccctg 


tagcggcgca 


ttaagcgcgg 


1939 


cgggtgtggt 


ggttacgcgc 


agcgtgaccg 


ctacacttgc 


cagcgcccta 


gcgcccgctc 


1999 


ctttcgcttt 


cttcccttcc 


tttctcgcca 


cgttcgccgg 


ctttccccgt 


caagctctaa 


2059 


atcggggcat 


ccctttaggg 


ttccgattta 


gtgctttacg 


gcacctcgac 


cccaaaaaac 


2119 


ttgattaggg 


tgatggttca 


cgtagtgggc 


catcgccctg 


atagacggtt 


tttcgccctt 


2179 


tgacgttgga 


gtccacgttc 


tttaatagtg 


gactcttgtt 


ccaaactgga 


acaacactca 


2239 


accctatctc 


ggtctattct 


tttgatttat 


aagggatttt 


ggggatttcg 


gcctattggt 


2299 


taaaaaatga 


gctgatttaa 


caaaaattta 


acgcgaatta 


attctgtgga 


atgtgtgtca 


2359 


gttagggtgt 


ggaaagtccc 


caggctcccc 


aggcaggcag 


aagtatgcaa 


agcatgcatc 


2419 


tcaattagtc 


agcaaccagg 


tgtggaaagt 


ccccaggctc 


cccagcaggc 


agaagtatgc 


2479 


aaagcatgca 


tctcaattag 


tcagcaacca 


tagtcccgcc 


cctaactccg 


cccatcccgc 


2539 


ccctaactcc 


gcccagttcc 


gcccattctc 


cgccccatgg 


ctgactaatt 


ttttttattt 


2599 


atgcagaggc 


cgaggccgcc 


tctgcctctg 


agctattcca 


gaagtagtga 


ggaggctttt 


2659 


ttggaggcct 


aggcttttgc 


aaaaagctcc 


cgggagcttg 


tatatccatt 


ttcggatctg 


2719 


atcagcacgt 


gatgaaaaag 


cctgaactca 


ccgcgacgtc 


tgtcgagaag 


tttctgatcg 


2779 


aaaagttcga 


cagcgtctcc 


gacctgatgc 


agctctcgga 


gggcgaagaa 


tctcgtgctt 


2839 


tcagcttcga 


tgtaggaggg 


cgtggatatg 


tcctgcgggt 


aaatagctgc 


gccgatggtt 


2899 


tctacaaaga 


tcgttatgtt 


tatcggcact 


ttgcatcggc 


cgcgctcccg 


attccggaag 


2959 


tgcttgacat 


tggggaattc 


agcgagagcc 


tgacctattg 


catctcccgc 


cgtgcacagg 


3019 


gtgtcacgtt 


gcaagacctg 


cctgaaaccg 


aactgcccgc 


tgttctgcag 


ccggtcgcgg 


3079 


aggccatgga 


tgcgatcgct 


gcggccgatc 


ttagccagac 


gagcgggttc 


ggcccattcg 


3139 


gaccgcaagg 


aatcggtcaa 


tacactacat 


ggcgtgattt 


catatgcgcg 


attgctgatc 


3199 


cccatgtgta 


tcactggcaa 


actgtgatgg 


acgacaccgt 


cagtgcgtcc 


gtcgcgcagg 


3259 


ctctcgatga 


gctgatgctt 


tgggccgagg 


actgccccga 


agtccggcac 


ctcgtgcacg 


3319 


cggatttcgg 


ctccaacaat 


gtcctgacgg 


acaatggccg 


cataacagcg 


gtcattgact 


3379 


ggagcgaggc 


gatgttcggg 


gattcccaat 


acgaggtcgc 


caacatcttc 


ttctggaggc 


3439 


cgtggttggc 


ttgtatggag 


cagcagacgc 


gctacttcga 


gcggaggcat 


ccggagcttg 


3499 


caggatcgcc 


gcggctccgg 


gcgtatatgc 


tccgcattgg 


tcttgaccaa 


ctctatcaga 


3559 


gcttggttga 


cggcaatttc 


gatgatgcag 


cttgggcgca 


gggtcgatgc 


gacgcaatcg 


3619 


tccgatccgg 


agccgggact 


gtcgggcgta 


cacaaatcgc 


ccgcagaagc 


gcggccgtct 


3679 


ggaccgatgg 


ctgtgtagaa 


gtactcgccg 


atagtggaaa 


ccgacgcccc 


agcactcgtc 


3739 


cgagggcaaa 


ggaatagcac 


gtgctacgag 


atttcgattc 


caccgccgcc 


ttctatgaaa 


3799 
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aattaaactt 

yy*-LyyyL.LL 


caaaatcatt 

VJ VA VA. V* V— V-J V* V» 


ttccaaaaca 

V* \m V* V- \A tA. V»» V^J 


ccaactaaat 


gatcctccag 


cacaaaaatc 

v-yv-yyyyu ^. 


3859 


tcatactaaa 

V- V- vl l» VJ V- \A VA 


attcttcacc 

Vj V* v- v— V- V- v_# vj \— V_ 


caccccaact 


tgtttattgc 


agcttataat 


ggttacaaat 


3919 


iA (A M. vj W C4. (A V-VAVA 


cat cacaaat 

v-U. *— V__ VA V— VA VA VA v. 


ttcacaaata 


aaacattttt 


ttcactgeat 


tctagttgtg 


3979 


ntttatc raa 

V_j I— l_ 1— y L_ V— V_ U U 


aetcatcaat 


atatcttatc 


at at eta tat 

u t_ y — . v_ i_ y v- u. i_ 


accat caacc 

VA V— V* VA 1m V-tf VA VA. V— V- 


tctaactaaa 

V. V— 1— VA VA V_» V- VA VI VA 


4039 




atratnatra 

o. lv>u uy y i_ v_ u 


taactatttc 

Lay v. l y l l. ll 


ctdtotaaaa 

v_, iw y uy i_y uuu 


ttattatcca 

l- Ly L L (A L V— ^->y 


ctcacaattc 

V— V- V_ VA. V- VA VA V. V- V_* 


4099 




araaarcaaa 


aacataaaat 


ataaaaccta 

y Luuuy v»v* uy 


aaa t a cc taa 

yyy *-y luu 


taaataaact 

Lyuy Lyuy v- l 


4159 


aactcacatt 

UUV_ t- L l_ 


aattacatta 

l Ly v»y s3 


cactcactac 


ccactttcca 


atcaaaaaac 

y Lv^yyyuuuv. 


ctatcatacc 

Vj_» V» VV< V| V*V^Vj-fV-t 


4219 


aactacatta 

U.y \_ V> y N_ U V. *_ U 


ataaatcaac 

u Ly uu _. v. y y v. 


caacacacaa 


aaaaaaacaa 

y y«.yuy y v-yy 


tttgegtatt 


gggegctett 


4279 


ccacttcctc 

y v- l v v. v- v. v» 


actcactaac 

Vj \m V, V— . VA X— < I— VJ V-> 


tcactacact 


caatcattca 

v-y y ^v-y v. «_ y 


actacaacaa 

y x— Lyv-yy»-.yu 


acaatatcaa 


4339 


ctcactcaaa 


aacaataata 

yyv-yy luu lu 


caattatcca 

y y v- fc* i» v» v< i* 


caaaatcaaa 


aaataacaca 

VA VA VA V— VA VA V— VA VA 


ggaaagaaca 


4399 


tataaacaaa 

uy l y u. y v»u.ttci 


aaaccaacaa 

uy y x— v.wy v>uu 


aaaaccaaaa 

uuy y v_ uy y u 


accataaaaa 


aaccacatta 

yy^-^—y y *-y 


ctaacatttt 


4459 


tcrataaact 

l. v. v_ a. Layyv. L. 


v_ v_ y v_ v_ v_ v— v_ v_ i_ 


aacaaacatc 

yuv_.yuyv_.tj. i_v_. 


acaaaaat ca 

V< C4. U. U U. Ll L. V- y 


acactcaaat 

u \— y v.v_uuy l 


caaaaataac 

v.uyuyy Lyy v. 


4519 




aaoactataa 

uy y u v. i— u i— uu 


3LQ3LT3LCC3LOO 

u y u. l. u <_. v.uy y 


catttccccc 


taaaaactcc 

_J v« v_^ v_* 


ctcatacact 

V Vtf V_* VA V- VA V- V-J \__» V- 


4579 


ctcct at tec 


aaccctacca 

y u \_ v- v- i- V- y 


cttaccaaat 


acctatccac 

VA. V— ■ V-» V. VJ V- V_^ V_« VA V_* 


ctttctccct 


tcaaaaaaca 

L v<y y y uuy v-y 


4639 


taacactttc 

L y y y l l v. v- 


tcaatactca 

i— v— uu l y v. i_ v. u 


cactataaat 


atctcaattc 

VA> V- V__ V. V_* VAi VA v- V_i V_# 


aatataaatc 

yy L y Luyy lv- 


gttcgctcca 


4699 


aactaaacta 


tatacacaaa 

v. y i— y u v— y u u 


ccccccattc 

V_- V__. V« V__ V_« V_- V4 v_, v- V— 


aacccaacca 

w.y V— V^, V— VA VA. v_* v_- VA 


ctacacctta 

V-- V- VA V* VA; V_ V_* V* V- VA. 


teeggtaact 


4759 


atratcttaa 

u _, v_ y l v_ l l. y u 


at ccaaccca 

y l k_ \— ci a. v- v- y 


ataaaacaca 

y *_uuy uv_uv_y 


act t at cocc 

u. v_ l_ V- U V- y \_ V- 


actaacaaca 

u v. . l y y u y v». u 


accactaata 

VJ V— V-» VA V_» V. VA VA V- VA 


4819 




rananrnann 


tatntaofiro 

Ld uy Layyuy 


ataetacaaa 
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taataaccta 
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4879 
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tctactaaaa 
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V— V_ VA VA V. V V V* 


4939 


traaaaaaaa 

i— v_ y y u.iAu.u.tiyj 


aattaataac 

a.y U- i_yy tuyv- 


tcttaatcca 


acaaacaaac 

VA V— VA. VA, VA V_w VA. V* VA V- 


caccgctggt 


aacaataatt 

v*y v»yy Lyy v. l 


4999 


tttttattta 

*- l i- v l y l l y 


caaacaacaa 


attacacaca 


aaaaaaaaaa 

VA VA VA VA W VA, VA VA> VA VA 


atctcaagaa 


gatcctttga 


5059 


tcttttctac 


aaaatctaac 


gctcagtgga 


acgaaaactc 


acgttaaggg 


attttggtca 


5119 


taaaattatc 

*— y €-4. y U L — . U L *_. 


aaaaaaoatc 


tt cacctaaa 

L_, l_ V4 V-, *— LA, VA> 


tccttttaaa 


ttaaaaatga 


agttttaaat 


5179 


caatrtaaaa 

L. ClU LV- LUUUy 


tatatataaa 


taaacttaat 

luuuv. i_ Lyy v> 


ctaacaatta 

»— v. y u v. .uy v_ i_ u. 


ccaatactta 

V— V-f VA VA, V_ VA V_ V. v_ VA 


atcaataaaa 

VA V- V-* VA V4 v_ Vj VA VA VA 


5239 


race tat ct c 

V— U X— V_ I— U I- v — L X— 


aacaatctat 


ctatttcatt 


catccataat 

v_ u i- v_ >_ u i_uy v. 


tacctaactc 

V- VA V-* V-* V> VA VA V_* V* V_- 


cccatcatat 

V— V_- V_» VA V» V-* v_ V-^ V- 


5299 


aaataactac 

uyu Lu.u.v_ l_uv_ 


aatacaaaaa 

yc*. i_c*.v_yyy«.y 


aactt accat 

y y v_ u. i_ u v_ v_ u i_ 


ctaaccccaa 

v.. Lyy llll u y 


tactacaata 

Ly v» Ly v-ut* l y 


ataccacaaa 

u Lu_v.yv.yuy 


5359 


acccacactc 

U V— V- U. v_ y V. L V- 


accaactcca 

u\i\>yy v. ■ v_ 


aatttatcaa 

y xa. t_ t_ v. V- \_u y 


caataaacca 

V.UU LUUUV.V.U 


accaaccaaa 

yv_v.uyL.v_.yyu 


aaaaccaaac 


5419 


acaaaaataa 

y ^* *•*• y y ^-yy 


tcctacaact 


ttat ccacct 

(_ u <- v. v— y v_> t • 


ccat ccaatc 

V- V_U <_ L* Luy (_ V> 


tattaattat 

L U L L UU L L y L 


taccaaaaaa 

Lyv.L.yyyuuy 


5479 


ctaaaataaa 


taatt caeca 


attaataatt 

y i_ l. uu i_ u y l. i_ 


tacacaacat 


tattaccatt 

l y L Ly V. L.U L L 


actacaaaca 

\j V_- V- VA V_# VA VJ VA 1 V_* VA 


5539 


tcgtggtgtc 


acgctcgtcg 


tttggtatgg 


cttcattcag 


ctccggttcc 


caacgatcaa 


5599 


ggcgagttac 


atgatccccc 


atgttgtgca 


aaaaagcggt 


tagctccttc 


ggtcctccga 


5659 


tcgttgtcag 


aagtaagttg 


gccgcagtgt 


tatcactcat 


ggttatggca 


geactgeata 


5719 


attctcttac 


tgtcatgeca 


teegtaagat 


gcttttctgt 


gactggtgag 


tactcaacca 


5779 


agtcattctg 


agaatagtgt 


atgeggegae 


egagttgetc 


ttgcccggcg 


teaataeggg 


5839 



WO 01/58493 



14 



PCT7DK01/00090 



ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa cgttcttcgg 5899 

ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg 5959 

cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga gcaaaaacag 6019 

gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga atactcatac 6079 

tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg agcggataca 6139 

tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag 6199 

tgccacctga cgtc 6213 
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